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5 ( 54 > ™ e: MULTIMERIC FORMS OF TNF SUPERFAMTLY LIGANDS 

00 (57) Abstract: A method for constructing stable bioactive fusion proteins of the difficult to express tumor necrosis factor superfamily 
OS (TNFSF), and particularly members CD40L (CD154) and RANKL/TRANCE, with collectins, particularly pulmonary surfactant 
J^j protein D (SPD) is described. Single trimers of these proteins lack the full stimulatory efficacy of the natural membrane forms of 
^ these proteins in many cases. The multimeric nature of these soluble fusion proteins enables them to engage multiple receptors on 
the responding cells, thereby, mimicking the effects of the membrane forms of these ligands. For CD40L-SPD, the resulting protein 
stimulates B cells, macrophages, and dendritic cells, indicating its potential usefulness as a vaccine adjuvant. The large size of these 
fusion proteins makes them less likely to diffuse into the circulation, thereby limiting their potential systemic toxicity. This property 
^ may be especially useful when these proteins are injected locally as a vaccine adjuvant or tumor immunotherapy agent to prevent 
^ them from diffusing away. In addition, these and other TNFSF-collecting fusion proteins present new possibilities for the expression 
^ of highly active, multimeric, soluble TNFSF members. 
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MULTXMERIC FORMS OF TNF SUPERFAMILY LIGANDS 
5 BACKGROUND OF THE INVENTION 



Field of the Invention 

The present invention relates to a method form preparing soluble 
multimeric proteins consisting of more than three iterations of the same bioactive 
10 molecule using recombinant DNA technology. 

The present invention particularly concerns a new method of producing 
multimeric fiision proteins involving the TNF superfamily (TNFSF) members as 
a fiision proteins with SPD, and more specifically, CD40L-SPD fiision proteins 
is and useful modifications thereof 



Description of Related Art 

Numerous proteins can be made using modem molecular biology 
techniques and used in diagnostic and therapeutic applications. Using 

20 recombinant DNA techniques, the DNA encoding a single amino acid chain is 
constructed and then introduced into a cell which manufactures the final protein. 
Some cells, especially bacteria like E. coh y lack the ability to properly fold the 
amino acid chains into the proper quaternary structure and they often fail to apply 
the necessary modifications (e.g., glycosylation and disulfide bond formation) that 

25 are needed for the protein to be bioactive and resistant to degradation in vivo. 
While most of these challenges can be met by expressing the amino acid chain in 
eukaryotic cells like yeast or mammalian cells in vitro, it is not always 
straightforward to express proteins that consist of two or more amino acid chains. 
In general, for multichain proteins, the single amino acid chains must associate 

30 together in some way either within the producer cell or subsequently after the 
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monomers are secreted from the producer cell. For artificially constructed 
molecules, the introduction into a single amino acid chain of an amino acid 
sequence which causes this chain-to-chain association can be an important step in 
producing multichain proteins. 

5 

One of the most widely used methods of causing two amino acid chains to 
associate is to conjoin, at the DNA coding level, segments from the protein of 
interest and a segment from a spontaneously dimerizing protein. The best 
example is to conjoin or fuse a protein with the Fc portion of immunoglobulin, 

10 creating a dimeric Fc fusion protein (Fanslow et al, J. Immunol 136:4099, 1 986). 
A protein of this type can be formed from the extracellular domain of a tumor 
necrosis factor receptor fused to Fc (termed etanercept and marketed as 
ENBREL®), which is effective in the treatment of rheumatoid arthritis. A second 
example is the construction of a fusion protein between the dimerizing 

is extracellular portion of CD8 with the extracellular portion of CD40L 

(Hollenbaughe/a/.,£MB<9</. 11:4313,1992). Here, the dimerizing CD8 portion 
of the fusion protein helps to maintain the CD40L portion in the trimeric form 
needed for its bioactivity. A more recent example is the addition of an isoleucine 
zipper motif to CD40L, which permits the production of trimeric soluble CD40L 

20 molecules (Morris et al, J. Biol Chem. 274:41 8, 1 999). 

The TNF superfamily (TNFSF) consists of an expanding number of 
proteins (see Table I) which are crucial for the development and functioning of 
the immune, hematological, and skeletal systems. TNFSF proteins are ligands for 
25 a corresponding set of receptors of the TNF receptor superfamily (TNFRSF). All 
TNFSF members are expressed as Type II membrane proteins, with the exception 
of lymphotoxin-alpha which is produced as a secreted protein. However, soluble 
forms of several TNFSF proteins can be released from the cell surface by 
proteolytic cleavage, usually by specific metalloproteinases. 



2 
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The production of soluble forms of TNFSF proteins has been an important 
step in the study of these proteins. Soluble TNFSF ligands can be used to study 
the activities of these proteins in vitro without the complexities in interpretation 
5 that result when cells or cellular membranes expressing TNFSF proteins are 
added. In addition, soluble forms of several TNFSF proteins have potential as 
therapeutic agents for human diseases. In particular, TNF-D has been extensively 
studied for the treatment of cancer and soluble CD40L is currently undergoing 
clinical trials to assess its antitumor effects. 

10 

To produce soluble forms of TNFSF proteins, either the membrane protein 
is expressed in a cell line possessing a protease capable of separating the TNFSF 
extracellular domain from the transmembrane domain or a truncated form of the 
TNFSF protein is produced which consists solely of the extracellular domain plus 

15 a signal sequence. In either case, certain soluble forms of TNFSF proteins are 
unstable in solution as simple homotrimers composed solely of the extracellular 
domain. For example, naturally solubilized TNF-D is labile under physiological 
conditions [Schuchmann, 1995 #129]. To solve this stability problem, chimeric 
proteins have been constructed according to one of four different design 

20 principles: (1) The extracellular portion of the TNFSF protein has been expressed 
fused to the dimeric portion of the immunoglobulin Fc fragment US Patent 
5,155,027, Oct. 13, 1992 Jssued to, Andrzej Z. Sledziewski, et al. In the case of 
CD40L and OX40L, this yields a soluble molecule which is significantly less 
active than the native membrane form of this protein. (2) The extracellular 

25 portion of the TNFSF protein has been expressed with an antigenic tag (usually 
the FLAG motif) fused to its N-terminus [Mariani, 1996 ]. The addition of an 
antibody to the tag (e.g., anti-FLAG antibody) aggregates these proteins into a 
multimeric form. Crosslinking enhances activity on B cells. (3) The extracellular 
portion of the TNFSF protein has been expressed fused to the spontaneously 

3 
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dimerizing extracellular portion of the CD8 molecule [Hollenbaugh, 1992 ]. In the 
case of CD40L, this creates a hexameric molecule [Pullen, 1 999 ] which is likely 
formed by two CD40L trimers attached to three CD8 dimeric stalks. Despite this, 
the addition of an anti-CD8 antibody to crosslink the CD40L-CD8 fusion protein 
yields a further enhancement of CD40L activity on B cells. (4) The extracellular 
portion of the TNFSF protein has been expressed fused to a trimerizing isoleucine 
zipper which maintains the overall trimeric structure of the protein [US 
Patent5,716,805, Feb. 10, 1998, issued to Subashini Srinivasan et al. This soluble 
CD40L trimer or 4 sCD40LT is the form of that protein now being clinically 
tested in humans for its anti-tumor effects. 

Compounding the difficulties in producing stable forms of soluble TNFSF 
proteins are compromises in bioactivity. As exemplified by FasL, TNF, and 
CD40L, many of the soluble forms of these proteins lack the full range of 
stimulatory activities displayed by the membrane forms of these molecules. For 
FasL, several groups have reported that naturally produced soluble FasL 
(generated by proteolytic cleavage from the membrane form) has a spectrum of 
activities that is distinctly different from the membrane form. Soluble FasL 
induces apoptosis in activated CD4+ T cells but not fresh, resting CD4+ T cells. 
In contrast, both types of CD4+ T cells are killed by membrane FasL or a 
recombinant soluble form of FasL (WX1) that spontaneously aggregates into 
oligomers larger than a decamer. For TNF, T cell activation through stimulation 
of TNFR II, the 80 kDa receptor for TNF, is much greater with membrane TNF 
than soluble TNF. However, if soluble TNF is produced as a tagged protein and 
crosslinked with an antibody against the tag, then it completely mimics the 
activities of membrane TNF [Schneider, 1998]. Finally, for CD40L, the 
stimulatory effects of a soluble form of this TNFSF protein are enhanced by 
crosslinking [Kehry, 1994] and yields an activity similar to membrane CD40L. 
For example, soluble CD40L-CD8 fusion protein requires crosslinking with a 
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antibody to CD8 in order to drive resting B cells to proliferate to a degree similar 
to membrane-bound CD40L. ). Even more strikingly, although membrane-bound 
CD40L expressed on baculovirus-transduced SF9 insect cells is a strong B cell 
stimulus, small vesicles (10 - 1,500 nm) prepared from the membranes of these 
5 cells are less stimulatory. However, ultracentrifiigation of these vesicles creates 
aggregates which have the full activity of the original membrane CD40L protein. 
This indicates that B cells are more highly stimulated by a large surface of CD40L 
than they are by a smaller surface expressing this membrane ligand. 



10 Taken together, the above reports suggest that, for some TNFSF/TNFRSF 

ligand/receptor pairs at least, it is essential to cluster receptors together for full 
signaling activity. By this interpretation, the efficacy of the membrane forms of 
FasL, TNF, and CD40L occurs because these ligands can move in the plane of the 
membrane toward the contact zone with a receptor-bearing responding cell, 

15 thereby clustering ligated receptors to form a receptor-dense region of the 
membrane. This interpretation is further supported by experiments where 
crosslinking of a soluble TNFSF protein effectively mimics the activity of the 
membrane form of the protein [Scheider, 1998]. 

20 In all of the above examples, no more than three amino acid chains have 

been caused to associate together. There is a need to produce multimeric protein 
molecules where more than three amino acid chains are caused to associate into a 
single soluble molecular complex. An important example comes from studies of 
CD40L (also called CD 154 or TNFSF5), which is a member of the TNF family of 

25 molecules that are normally expressed as insoluble, cell membrane proteins. It 
has been shown that soluble homotrimers composed of the extracellular regions of 
CD40L, TNF, and FasL are not potently active on resting cells that bear receptors 
for these proteins. However, if these proteins are expressed with a tag on their 
ends (e.g., the FLAG peptide sequence) and then the trimers are extensively 
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crosslinked using an antibody to FLAG, full activity appears (Schneider et al 9 J. 
Exp. Med 1 87: 1205, 1 998). From this, it can be inferred that the soluble single- 
trimer forms of these molecules does not duplicate the multivalent interactions 
that normally occur when a receptor-bearing cell comes in contact with the 
5 membrane of a cell expressing numerous ligand trimers on its surface. This 

distinction may be due to a need for receptor clustering for full signaling (Bazzoni 
and Beutler, K Engl J. Med. 334: 171 7, 1996), which in turn is only possible with 
a multimeric ligand engaging many receptors at the same time in a localized 
region of the cell membrane. 

10 

SUMMARY OF THE INVENTION 



The present invention contemplates a method of preparing soluble, 
multimeric mammalian proteins by culturing a host cell transformed or transfected 
15 with an expression vector encoding a fusion protein comprising the hub, body, 
and neck region of a collectin molecule and a heterologous mammalian protein. 

In one embodiment, the heterologous mammalian protein comprises an 
extra cellular domain of a mammalian transmembrane protein; the resulting fusion 
20 protein forms a multimer. 



In another embodiment, the heterologous mammalian protein comprises a 
soluble protein such as a cytokine; the resulting fusion protein forms a multimer. 

25 In another embodiment, sites of proteolytic degradation are included or 

removed from the fusion protein; the resulting fusion protein forms a multimer 
from which are cleaved single units at a rate made variable by the nature of the 
proteolytic digestion sites either included or excluded. 
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In yet another embodiment, special attention is given to the 
immunogenicity of the fusion protein by altering the junction between the two 
naturally occurring proteins from which it is made; the resulting fusion protein 
may be less or more able to elicit an immune response against itself, which could 
lengthen its persistence or contribute to it immunological effectiveness. 

A hybrid nucleotide sequence of no more than 1528 base pairs including a 
sequence defining a structural gene expressing a conjoined single strand of a 
multimeric TNFSF-SPD fusion protein, said structural gene having a nucleotide 
base sequence selected from members of the group consisting of SEQ ID NO 1, 
SEQ ID NO 3 and SEQ ID NO 5 is disclosed by this invention. In one 
embodiment, the DNA segment the structural gene has a sequence expressing a 
single hybrid amino acid chain of TNFSF-SPD, the segment having a first SPD 
nucleotide base sequence of SEQ ID NO 1 , from base 32 to base 799, and a 
second sequence, expressing a portion of TNFSF stalk, selected from members of 
the group consisting of SEQ ID NO 1, from base 800 to base 1444, SEQ ID NO 3, 
from base 800 to base 1528, and SEQ ID NO 5, from base 800 to base 1441. 

In another embodiment, a recombinant DNA molecule has vector 
operatively linked to an exogenous DNA segment defining a structural gene 
expressing a single amino acid chain of TNFSF-SPD. This structural gene has a 
nucleotide base sequence selected from members of the group consisting of SEQ 
ID NO 1 , SEQ ID NO 3 and SEQ ID NO 5, any functional equivalents and 
modifications thereof There is also attached an appropriate promoter for driving 
the expression of said structural gene in a compatible host organism. The 
organism can be E. coli, a yeast, a higher plant or animal. 

Yet another embodiment contemplated by the invention is multimeric 
TNFSF-SPD fusion protein having a plurality of polypeptide trimers, a first 
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trimer consisting of peptide strands of members of the TNF superfamily (TNFSF) 
of ligands, and a second trimer strand from a collectin molecule, each first trimer 
conjoined to a second polypeptide trimer strand from a collectin molecule, 
wherein said ligand strand is substituted for native carbohydrate recognition 
domains (CRD) of the collectin molecules. The conjoined collectin strands are 
covalently bound in parallel to each other, forming a multimeric fusion protein 
comprising a plurality of trimeric hybrid polypeptide strands radiating from a 
covalently bound center hub of the molecule. The free end of each trimeric 
radiating strand has a TNFSF moiety attached. The TNFSF moiety is one 
selected from the group consisting of ligands LTA, TNF, LTB, and TNFSF4 to 
TNFSF 18 as shown in Table II, and their funtional equivalents, and modifications 
thereof 

The invention also contemplates a method for preparing a CD40-SPD 
multimeric fusion polypeptide, including the steps of initiating a culture, in a 
nutrient medium, of procaryotic or eucatryotic host cells transformed with a 
recombinant DNA molecule including an expression vector, appropriate for the 
cells, operatively linked to an exogenous DNA segment defining a structural gene 
for CD40-SPD ligand. The structural gene has a nucleotide base sequence of SEQ 
ID NO 1 from about base 32 to about base 1444. Thereafter, the culture is 
maintained for a time period sufficient for the cells to express the multimeric 
molecule. 

Also contemplated is a method of producing a secreted, very large, 
biologically active, multimeric tumor necrosis factor superfamily ligand fusion 
protein chimera that is highly immunogenic and not readily difiusable. The steps 
for this method are as follows: 

1 . introducing into a host cell a first chimeric DNA construct including a 
transcriptional promoter operatively linked to a first secretory signal sequence, 

8 
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followed downstream by, and in proper reading frame with a first DNA sequence 
encoding a polypeptide chain of a first TNFSF ligand requiring multimerization 
for biological activity. This sequence is joined to a second DNA sequence 
encoding a collectin polypeptide at the site where the collectings CRD was 
5 purposefully removed. 

2. introducing into the host cell, a second DNA construct including a 
transcriptional promoter operably linked to a second secretory signal sequence 
followed downstream by, and in proper reading frame with, a third DNA sequence 
encoding a second polypeptide chain of a second TNFSF ligand Joined to a 

10 fourth DNA sequence encoding a collectin polypeptide , wherein the collectings 
CRD was purposefully removed, and then, 

3. growing the host cell in an appropriate growth medium under 
physiological conditions to allow the secretion of a large multimerized 
polypeptide fusion protein, wherein the first polypeptide chain of a TNFSF-SPD 

15 protein is bound by parallel bonding of the respective collectin domain trimer to 
the second polypeptide chain of a different TNFSF-SPD polypeptide trimer, and 
wherein the multimerized polypeptide fusion protein exhibits biological activity 
characteristic of both membrane-attached TNFSFs, and 

4. isolating the biologically active, multimerized TNFSF-SPD polypeptide 
20 fusion from said host cell. The chimeric reactant compounds are humanized to 

guard against destruction by a potential human recipient's immune system, 

A final method of preparing a multimeric TNFSF-SPD ligand fusion 
protein contemplated requires a) preparing a first DNA segment coding for a 
25 strand of an exposed extracellular portion of TNFSF; b) preparing a second DNA 
segment coding for a collectin polypeptide strand, wherein the collectings CRD 
domain of the strand has been removed; c) conjoining the first and second DNAs 
in proper reading frame, thereby creating a TNFSF-collectin DNA construct; d) 
inserting the construct into an expression vector system; e) introducing the vector 
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system into an appropriate cell in culture under suitable conditions; f) harvesting 
and purifying spent medium from the culture; and finally g) assaying for presence 
of multimeric TNFSF-collectin fusion protein. 

A method for stimulating the immune response in potentially 
immonocompetent cells using multimeric TNFSF fusion proteins by contacting 
the cells with the multimeric TNFSF fusion proteins, causing the cells to 
proliferate, is also contemplated. The cells used may be resting B cells. There is 
also a method for increasing antigenicity of cells by contacting the cells with the 
multimeric TNFSF fusion proteins. In this case, the cells may be tumor cells or 
HTV positive cells. 

Other preferred embodiments contemplate the methods of preparation 
described above, wherein the host transformed is either a prokaryote, such as R 
coli, a eukaryote, for example yeast, such as S. cerevisiae . or a higher plant, such 
as alfalfa or tobacco. 

Still further embodiments and advantages of the invention will become 
apparent to those skilled in the art upon reading the entire disclosure contained 
herein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Structure of the CD40L-SPD fusion protein. The extracellular 
portion of the CD40L homotrimer, including its membrane-proximal stalk, was 
fused to the body of SPD. The N-terminus of SPD contains two cysteines which 
link the homopolymer together by disulfide bonds forming a hub. The trimeric 
collagenous stalk extend from the hub as a cruciate structure and end in a 
spontaneously trimerizing neck region. The amino acid domains in a single chain 

10 
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of the CD40L-SPD are shown at the top. At the bottom is the tetrameric (four 
CD40L trimers) which is expected to form. In addition, the hub region of SPD 
can participate in stacking up to 8 or more cruciate forms into higher order 
aggregates. 

5 

Figure 2. Ion-exchange chromatography of murine CD40L-SPD. CHO cells 
expressing murine CD40L-SPD were grown in serum-free media, concentrated 
using a 100 kDa cutoff ultrafiltration membrane, and diafiltered into 50 mM 
bicine, pH 9.0, 1 mM EDTA. Using an FPLC system, the protein from 400 mL of 
10 media was applied to a Fractogel SCV 650M column and eluted with a linear salt 
gradient. 3 mL samples were collected. Shown are curves for protein 
concentration (OD280X conductivity as % 1 M NaCl in the buffer, and ELISA- 
detectable CD40L-SPD assayed at 1 : 100 dilution. 

15 Figure 3* Size fractionation of murine CD40L-SPD by ultrafiltration. 

CD40L-SPD is a 471 amino acid protein with a predicted molecular weight of 
49,012 for each of the twelve component chains in the dodecamer (composed of 
four trimeric subunits). This does not include added carbohydrates. Therefore, 
the full dodecamer will have a molecular weight in excess of 600,000. However, 

20 from the literature on recombinant surfactant protein D made in CHO cells, it 
appears that some of the product will be in the form of trimers that are not part of 
a cruciate-formed dodecamer. To determine what percentage of CD40L-SPD was 
produced in a multimeric form, supernatant from the transfected CHO cells were 
passed through filters of different porosities (rated for their ability to retard 

25 globular proteins). An ELISA was used to detect the amount of CD40L-SPD 
(measured at multiple dilutions) that passed through the filter. As shown, about 
90% of the protein is retained by a 300,000 kDa cut-off filter. This indicates that 
most of the protein is in the dodecameric form. In addition, the cruciate 
dodecamers of surfactant protein D can also stack on top of each other into even 

11 



WO 01/42298 PCT/US00/07380 

higher molecular weight forms. This is the likely explanation for the small 
fraction of CD40L-SPD that is retained by the 1,000 kDa cut-off filter. 



Figure 4. Activation of human B cells by human CD40L-SPD. Conditioned 
media from CHO cells expressing human CD40L-SPD was added to human B 
cells along with IL-4. In the left panel, the cells were stained with CyChrome- 
labeled anti-CD19 to identify B cells and PE-labeled anti-CD3 to identify T cells. 
As shown, most of the cells proliferating in the culture were CD 19+ CD3- B cells. 
In the right panel, the cells were stained with CyChrome-labeled anti-CD 19 to 
identify B cells and PE-labeled anti-CD80 (B7-1) to identify this co-stimulatory 
molecule. As shown, almost all of the B cells were induced by CD40L-SPD to 
express CD80. 

Figure 5. Activation of murine B cells by murine CD40L-SPD. Murine 
15 CD40L-SPD was added to resting murine splenic B cells for a two day culture 
period. For the final 4 hours, the cultures were pulsed with 3 H-thymidine, 
following which the cells were harvested and DNA synthesis was measured by 
scintillation counting. As shown, CD40L-SPD is nearly as effective as anti-IgM 
in promoting the proliferation of resting B cells. 

20 

Figure 6. CD40L-SPD stimulation of macrophage chemokine production. 

Conditioned media from CHO cells expressing human CD40L-SPD, an inactive 
mutant of human CD40L-SPD (T 1 47N-CD40L-SPD), or murine CD40L-SPD 
(mCD40L-SPD) were added to cultures of human monocyte-derived 
25 macrophages. As a negative control, this media was heat-inactivated at 60 °C for 
30 minutes. Also shown is a form of soluble CD40L (sCD40L) consisting of 149 
amino acids from the extracellular domain of human CD40L (Peprotech) added at 
1 Dg/mL. 24 hours later, supernatants were collected and assay for MIP-1 □ by 
ELISA (R&D Systems). The weak activity of soluble single-trimer CD40L 

12 
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(sCD40L) is apparent. In contrast, native human and murine CD40L-SPD 
strongly activated the macrophages to produce MIP-1 □. In contrast, heat- 
inactivated CD40L-SPD was inactive. As expected, the inactive mutant, T147N- 
CD40L-SPD, also failed to stimulate macrophages, demonstrating that the CD40L 
portion and not the SPD portion of the protein was responsible for stimulating the 
macrophages. 

Figure 7. Expression of RANKL/TRANCE-SPD production from CHO cells 
detected by ELISA. Antibodies against RANKL/TRANCE were used to 
construct an ELISA capable of detecting the RANKL/TRANCE protein. As 
shown, there was no background with the media control. Using a fusion protein 
between CD70 (CD27L or TNFSF7) and SPD, there was also no signal, indicating 
the specificity of the ELISA. However, using CHO cells transfected with an 
expression plasmid for CD70-SPD, immunoreactive secreted protein was clearly 
detectable. This demonstrates the generalizability of the method for expressing 
TNFSF members as fusion proteins with collectins such as SPD. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
1. Definition of Terms 

Multimeric: As used herein the term multimeric refers to a multimer of a 
polypeptide that is itself a trimer (i.e., a plurality of trimers). 

Functional Equivalent: Herein refers to a sequence of a peptide or 
polypeptide that has substantial structural similarity and functional similarity to 
another such sequence. 

Modifications: Herein refers to point changes involving single amino 
acids, wherein the functionality is altered, without appreciably altering the 
primary sequence or primary structure of a peptide or polypeptide. 

Amino Acid: All amino acid residues identified herein are in the natural L- 
configuration. In keeping with standard polypeptide nomenclature, J. Biol. 

13 
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Chenu 243:3557-59, (1969), abbreviations for amino acid residues are as shown 
in the following Table of Correspondence: 



TABLE OF CORRESPONDENCE 



10 




OA/Ik jTT>/-\T 

SYMBOL 


AMINO ACID 




1 -Letter 


3-Letter 






Y 


Tyr 


L-tyrosine 




G 


Gly 


glycine 


15 


F 


Phe 


L-phenylalanine 




M 


X £ a. 

Met 


L-methionine 




A 


Ala 


L-alanine 




S 


Ser 


L-serine 




L 


He 


L-isoleucine 


20 


L 


Leu 


L-leucine 




T 


Thr 


L-threonine 




V 


Val 


L-valine 




P 


Pro 


L-proline 




K 


Lys 


L-lysine 


25 


H 


His 


L-histidine 




Q 


Gin 


L-glutamine 




E 


Glu 


L-glutamic acid 




W 


Trp 


L-tryptophan 




R 


Arg 


L-arginine 


30 


D 


Asp 


L-aspartic acid 




N 


Asn 


L-asparagin 




C 


Cys 


L-cysteine 



It should be noted that all amino acid residue sequences are represented herein by 
35 formulae whose left to right orientation is in the conventional direction of amino- 
terminus to carboxy-terminus. Furthermore, it should be noted that a dash at the 
beginning or end of an amino acid residue sequence indicates a bond to a radical 
such as H and OH (hydrogen and hydroxyl) at the amino- and carboxy-termini, 
respectively, or a further sequence of one or more amino acid residues up to a 



14 



WO 01/42298 PCT/USOO/07380 

total of about fifty residues in the polypeptide chain. 

Base Pair (bp): A partnership of adenine (A) with thymine (T), or of 
cytosine (C) with guanine (G) in a double stranded DNA molecule. 

Constitutive promoter: A promoter where the rate of RNA polymerase 
binding and initiation is approximately constant and relatively independent of 
external stimuli. Examples of constitutive promoters include the cauliflower 
mosaic virus 35S and 19S promoters described by Poszkowski et al., EMBOJ., 
3:2719 (1989) and Odell et al, Nature, 313:810 (1985). 
DNA: Desoxyribonucleic acid. 

Enzyme: A protein, polypeptide, peptide RNA molecule, or multimeric 
protein capable of accelerating or producing by catalytic action some change in a 
substrate for which it is often specific. 

Expression vector: A DNA sequence that forms control elements that 
regulate expression of structural genes when operatively linked to those genes. 

Expression: The combination of intracellular processes, including 
transcription and translation undergone by a structural gene to produce a 
polypeptide. 

Insert: A DNA sequence foreign to the rDNA, consisting of a structural 
gene and optionally additional DNA sequences. 

Nucleotide: A monomeric unit of DNA or RNA consisting of a sugar 
moiety (pentose), a phosphate, and a nitrogenous heterocyclic base. The base is 
linked to the sugar moiety via the glycosidic carbon (1 9 carbon of the pentose) and 
that combination of base and sugar is a nucleoside. When the nucleoside contains 
a phosphate group bonded to the 3 r or 5' position of the pentose it is referred to as 
a nucleotide. 

Operatively linked or inserted: A structural gene is covalently bonded in 
correct reading frame to another DNA (or RNA as appropriate) segment, such as 
to an expression vector so that the structural gene is under the control of the 
expression vector. 
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Polypeptide and peptide: A linear series of amino acid residues 
connected one to the other by peptide bonds between the alpha-amino and 
carboxy groups of adjacent residues. 

Promoter: A recognition site on a DNA sequence or group of DNA 
sequences that provide an expression control element for a gene and to which 
RNA polymerase specifically binds and initiates RNA synthesis (transcription) of 
that gene. 

Inducible promoter: A promoter where the rate of RNA polymerase 
binding and initiation is modulated by external stimuli. Such stimuli include light, 
heat, anaerobic stress, alteration in nutrient conditions, presence or absence of a 
metabolite, presence of a ligand, microbial attack, wounding and the like. 

Spatially regulated promoter: A promoter where the rate of RNA 
polymerase binding and initiation is modulated in a specific structure of the 
organism such as the leaf, stem or root. Examples of spatially regulated 
promoters are given in Chua et al., Science, 244: 1 74- 1 8 1 ( 1 989). 

Spatiotemporally regulated promoter: A promoter where the rate of 
RNA polymerase binding and initiation is modulated in a specific structure of the 
organism at a specific time during development. A typical spatiotemporally 
regulated promoter is the EPSP synthase-35S promoter described by Chua et al., 
Science, 244:174-181 (1989). 

Temporally regulated promoter: A promoter where the rate of RNA 
polymerase binding and initiation is modulated at a specific time during 
development. Examples of temporally regulated promoters are given in Chua et 
al., Science, 244:174-181 (1989). 

Protein: A linear series of greater than about 50 amino acid residues 
connected one to the other as in a polypeptide. 

Recombinant DNA molecule: A hybrid DNA sequence comprising at 
least two nucleotide sequences not normally found together in nature. 
RNA: Ribonucleic acid. 
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Selective Genetic marker: A DNA sequence coding for a phenotypical 
trait by means of which transformed cells can be selected from untransformed 
cells. 

Structural gene: A DNA sequence that is expressed as a polypeptide, i.e., 
an amino acid residue sequence. 

Synthetic promoter: A promoter that was chemically synthesized rather 
than biologically derived. Usually synthetic promoters incorporate sequence 
changes that optimize the efficiency of RNA polymerase initiation. 

2. Introduction 

This invention discloses the production of TNFSF proteins as multimeric 
(i.e., many trimers) ligands fused onto a trimeric, branched protein backbone. 
Collectin molecules are ideal for this purpose because they are formed from many 
trimeric, collagenous arms linked to a central hub by disulfide bonds. Of the 
collectins, pulmonary surfactant protein D (SPD) was chosen initially because it is 
a homopolymer encoded by a single gene, unlike Clq and surfactant protein A, 
which are composed of two different protein subunits. In addition, recombinant 
SPD has been successfully expressed in vitro in reasonable yield [Crouch, 1994], 
and a peptide containing the "neck" region of SPD was shown to spontaneously 
trimerize in solution [Hoppe, 1994]. Consequently, extracellular domains of 
human and murine CD40L were substituted for the carbohydrate recognition 
domain of pulmonary surfactant D (SPD) to create a four-armed molecule (three 
peptide chains per arm) with CD40L at the end of each arm. This molecule is 
named CD40L-SPD. In addition, because SPD tends to stack into higher order 
aggregates with up to 8 molecules associated at the hub [Crouch], even greater 
degree of multimerization can occur [Lu, 1993]. CD40L-SPD therefore mimics 
the expression of CD40L by an activated T cell in that it presents a multivalent 
complex similar to membrane-bound CD40L. While remaining soluble, CD40L- 
SPD equals membrane CD40L in its range of activities. 
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3. Construction of expression plasmids for CD40L-SPD . 

cDNAs of exposed human and murine CD40L, removed from cell 
membranes, were cloned by PCR by well-known methods. Murine surfactant 
protein D was cloned by hemi-nested PCR from murine lung mRNA (Clonetech). 
cDNA was prepared using Superscript II reverse transcriptase (Life Technologies, 
Gaithersburg, MD) and random hexamers as primers. PCR primer sequences 
were as follows (the underlined bases indicate restriction endonuclease sites for 
cloning into the vector): 



mSPD5: 5'-CTGACATGCTGCCCT^TCTCTCCATGC-3 , 

mSPD3ext: 5 , -GGAGGCCAGCTGTCCTCCAGCCTGTTTGC-3 , 

rmSPDS: 5'GGGGICTAGCGAATTCCACCAG GAAGCAATCTGAC ATGCTGCCCTTTCTCT 

CCATGC-3' 

CD40L/SPD3: S'-CTATCTrGTCCAACCTTCTATG/GCCATCAGGGAACAATGCAGCTTTCO' 

SPD/CD40L5; 5-AAAGCTGCATTGTTCCCTGATGGC/CATAGAAGGTTGGACA AGATAGAAG-3 1 

CD40L3 : 5-GGGCTCGAGGTA^ AGTTCTAC ATGCCTTGGAGTGTAT AAT-3 ' 

SPD/mCD40L5: S'-GAAAGCTGCATTGTTCCCTGATGGC/CATAGAAGATTGGATAAGGTCGAAG-S' 

mCD40L/SPD3 : 5 , -CTTCGACCTTATCCAATCTTCTATG/GCCATCAGGGAACAATGCAGCTTTC-3' 

mCD40L3: 5'-GGGGGGTACCCTGCTGCAGCCTAGGACAGCGCAC-3' 

Because the murine SPD sequence of the 5 r untranslated region containing 
the ribosomal binding site was unknown when this work was started [Motwani, 
1 995], a primer (rmSPDS) was designed based on the available rat sequence 
[Shimizu, 1992] which extended the 5' end with rat sequence (shown in bold) 
along with an added Nhe I site (underlined). 

4 Creation of the CD40L-SPD Fusions 

To create the CD40L-SPD fusions, overlap PCR was used. Murine SPD 
was amplified by nested PCR using mSPD5 and mSPD3ext for the first round of 
30 cycles. The product was diluted 1 .1,000 and 1 DL was amplified for another 
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30 cycles using rmSPD5 and CD40L/SPD3, where the 3 1 half of CD40L/SPD3 is 
a reverse primer for SPD C-terminal to the neck region (deleting the CRD) and 
the 5' half of CD40L/SPD3 contains bases from the N-terminus of the 
extracellular portion of CD40L (immediately adjacent to the transmembrane 
5 region). Similarly, the CD40L plasmid was amplified with SPD/CD40L5 and 
CD40L3, which contains a Kpn I site (underlined). All of these PCRs were 
performed with Pfu cloned polymerase (Stratagene,) using hot start (Ampliwax, 
Perkin-Elmer) and the thermocycling program: 94 °C for 2.5 min; then 30 cycles 
of 94 °C for 10 sec, 43 °C for 30 sec, and 75 °C for 7 min. 

10 

To form the chimeric construct, 1 \xLofa 1:1,000 dilution of gel-purified 
products from the above reactions was combined and amplified with rmSPD5 and 
CD40L3. Because Pfu polymerase did not consistently yield the expected 1 .62 kb 
overlap product, AccuTaq LA DNA polymerase (Sigma) was used for this PCR, 

15 using the thermocycling program: 94 °C for 2.5 min; then 30 cycles of 98 °C for 
20 sec, 43 °C for 30 sec, and 68 °C for 10 min. The resulting product was 
digested with Nhe I and Kpn I, gel-purified, and ligated into the Nhe I and Kpn I 
sites in the expression plasmid, pcDNA3.1(+) (Invitrogen, Carlsbad, CA). DH5 
E. coli were transformed with the construct and plasmid DNA was purified either 

20 by double banding in ethidium bromide-CsCl gradients or by anion exchange 
resin (QIAgen). To form the T147N-CD40L-SPD construct, the same approach 
was used except that the CD40L coding region was taken from the expression 
plasmid for T147N-CD40L [Kornbluth]. The amino acid sequence at the junction 
between SPD and CD40L is . . . KA ALFPDG/HRRLDKIE . . . , where the C-terminal 

25 portion begins the sequence for CD40L. To form mCD40L-SPD, a similar 
approach was taken except that primers SPD/mCD40L5, mCD40L/SPD3, and 
mCD40L3 were used for amplifications involving murine CD40L sequences. The 
amino acid sequence at the junction between SPD and murine CD40L is 
. .Kj\ALFPIXj/HRRLDKVE . where the C-terminal portion begins the sequence 
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for murine CD40L. Both DNA strands of each construct were sequenced to 
confirm that the constructs were correct. In other experiments, an entirely 
humanized construct, consisting of human CD40L fused to human SPD, was 
constructed (data not shown). 



5. Construction of expression plasmid for murine RANKL/TRANCE 
(TNFSF11). 

Spleen cells from C3H/HeJ mice were stimulated with 5 Dg/ml 
concanavalin A and 10 ng/ml IL-2 (Sigma) for 8 hours (31). mRNA was isolated 
using the Micro FastTrack kit (Invitrogen). cDNA was prepared using 
Superscript II reverse transcriptase (Life Technologies) and random hexamers as 
primers. PCR primer sequences were as follows (where the underlined bases 
indicate restriction endonuclease sites for cloning into the vector): 

5mRANKL-ext: S'-CATGTTCCTGGCCCTCCTCO' 

3 mRANKL-ext: 5'-GT AC AGGCTC AAG AGAG AGGGC-3' 

5mRANKL-im: 5-ATACTCGAGCGCAGATGGATCCTAAC-3' 

3mRANKL-int: 5'-GGGGTTTA GCGGCCGC TAATGTTCC ACQ A A ATG AGTTP-T 

The extracellular portion of RANKL/TRANCE was cloned by nested PCR. 
In the first round of PCR, 5mRANKL-ext and 3mRANKL-ext were used with Pfu 
cloned polymerase (Stragene) using the thermocycling program: 94 °C for 2.5 
min; then 30 cycles of 94 °C for 10 sec, 50 °C for 30 sec, and 75 °C for 2 min. 
The product was diluted 1 : 1 ,000 and 1 DL was amplified for another 30 cycles 
using 5mRANKL-int and 3mRANK-int, which contain an Xho I site and a Not I 
site respectively. The resulting product was digested with Xho I, blunt-ended 
with T4 DNA polymerase, then digested with Not 1 and gel-purified. The 
CD40L-SPD expression plasmid described above was digested with Msc I and 
Not 1 and gel purified. Then the RANKL/TRANCE sequence was ligated into 
this vector in frame with the SPD coding sequence. The amino acid sequence at 

20 



WO 01/42298 PCT/US00/07380 

the junction between SPD and RANKL/TRANCE is 

. . .KAALFPDG/RAQMDPNR. . . , where the N-terminal portion is from SPD and 
the C-terminal portion is the extracellular sequence of RANKL/TRANCE. Both 
DNA strands of each construct were sequenced to confirm that the constructs 
were correct. 

6. Stable transfection of DHFK-deficient CHO cells and amplification, 

DG44 (a line of CHO-K1 cells deficient in dihydrofolate reductase (DHFR)) (32) 
and pCHIP (a plasmid containing the hamster DHFR minigene) (33) were gifts from Dr. 
Lawrence Chasin, Columbia University, New York, NY . DG44 cells were cultured in 
□-MEM consisting of ribo- and deoxynucleoside-free D-MEM (BioWhittaker, 
Walkersville, MD) supplemented with 200 DM L-glutamine, 10% fetal bovine serum 
(FBS) and 10 Dg/ml each of adenosine, deoxyadenosine, and thymidine (Sigma). All 
cell cultures described were negative in a mycoplasma rRNA assay (Gen-Probe, San 
Diego). DG44 cells in six-well plates were transfected by the method of Okayama and 
Chen ((34) with 10 Dg of expression plasmid and 0.05 Dg of pCHIP (200:1 ratio). 
After two days, the transfected DG44 were trypsinized and transferred to 100 mm plates. 
At this point, the media was switched to D -MEM which differs from D-MEM in that 
dialyzed FBS (HyClone Systems, Logan, UT) was used and no nucleoside supplements 
were added. Only cells containing the DHFR minigene were able to grow in D -MEM, 
and colonies were selected after 10 days, cloned using cloning rings, and transferred to 
12.5 cm 2 flasks. Clones were selected for expansion using an ELISA to screen for the 
production of either murine or human CD40L (see below). Using the method described 
by Kingston et al (35), escalating doses of methotrexate were used to amplify the 
transfected genes over a period of 6-14 months until the cells grew well in 80 DM 
methotrexate. Each expressing clone was re-cloned once or twice more in order to select 
the highest expressing cells. 

7. Preparation of human and murine CD40L-SPD in serum-free media. 

Selected clones were adapted for growth in nucleoside-free UltraCHO 

media (BioWhittaker) supplemented with 50-100 pg/mL ascorbic acid and 50 pM 
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methotrexate (Sigma). The non-adherent population was further adapted for 
suspension growth in roller bottles. In some experiments, the cells were adapted 
from D'-MEM to CHO-S-SFM II media (Life Technologies) supplemented with 
ascorbic acid and 50 ng/mL L-proline. 



8. ELISA assay for human and murine CD40L-SPD . 

To assay for correctly folded CD40L, wells of a MaxiSorb 96-well plate 
(Nunc) were coated overnight at 4 °C with 50 DL of carbonate-bicarbonate, pH 
9.40 buffer containing 0.5 \ig/mL 24-31 anti-human CD40L MAb (Ancell) or 
MR1 anti-murine MAb (Bioexpress, Lebanon, NH). Wells were blocked with 3% 
bovine serum albumin (BSA) in PBS. 1 00 \iL samples were added to the wells 
either neat or diluted in a dilution buffer consisting of 1% BSA, 0.9 % NaCl, 50 
mM Tris pH 7.40, and 0. 1 % peroxide-free Tween 20 (Sigma). After shaking for 2 
h at 600 RPM, a plate washer was used to wash the plate four times with 0.9 % 
NaCl, 50 mM Tris pH 7.40, and 0. 1 % peroxide-free Tween 20. Then, 100 \iL of 
diluent buffer containing 1 ng/mL biotinylated 24-3 1 anti-human CD40L Mab 
(Ancell) or MR1 anti -murine CD40L Mab (Pharmingen, San Diego, CA) was 
added to each well and again shaken for 2 h. Following another four washes, 100 
(iL of diluent buffer containing 1 |ig/mL of streptavidin-alkaline phosphatase 
(Jackson) was added to each well and the plate was shaken for 1 hour. Lastly, 
after another four washes, color was developed for 1 0-20 min using 1 00 ^iL/well 
of BluePhos (Kierkegaard & Perry), stop solution was added, and the wells were 
read at 650 ^m in a plate reader. 

9. Purification of human and murine CD40L-SPD. 

Conditioned UltraCHO media was filtered using a 0.2 p PES filter unit 
(Nalgene) and stored at 4 °C for up to 3 months. A preliminary size fractionation 
was performed by ultrafiltration through a 100 kDa-cutoff 76 mm membrane 
(YM-100, Millipore) in a 400 mL stirred cell at 10 Ibs/sq. inch pressure of argon. 
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Media was concentrated to about 10 mL, diluted to 100 mL with buffer, and again 
concentrated to 10 mL for a total of 3 cycles of ultrafiltration and buffer 
exchange. Buffer was 50 mM Bicine (Calbiochem), adjusted to pH 9.0 with 
NaOH (about 32 mM Na), and 1 mM EDTA to prevent the activity of any 

5 metalloproteinase. Using FPLC equipment (Amersham-Pharmacia), the 

concentrate was filtered through a 0.45 \i filter, placed into a 10 mL superloop, 
applied to a 10 X 30 mm column (HR10/30, Amersham-Pharmacia) packed with 
Fractogel SOf 650M (EM Biosciences), and eluted at 0.5 mL/min at 4 °C with a 
linear gradient of 0-500 mM NaCl in buffer. As described by the manufacturer, 

10 the resolution of proteins on Fractogel S0 3 " is enhanced by using a long, thin 
column geometry. Fractions were collected and screened for human or murine 
CD40L by ELISA. Positive fractions were pooled, concentrated by ultrafiltration 
(CentriPrep-30, Millipore), filtered through a 0.45 \x filter, and applied to a 
Superose 6 column (Amersham-Pharmacia) in phosphate-buffered saline. 

15 

10. Murine B cell cultures. 

C3H/HeJ mice were euthanized by C0 2 inhalation under a protocol 
approved by the Animal Subjects Committee of the San Diego VA Healthcare 
System. Splenocytes were isolated by centrifugation over Lympholyte-M 

20 (Accurate Chemical & Scientific Corp.. Westburv. NY ) and B cells were isolated 
by negative selection using anti-CD43 immunomagnetic beads (Miltenyi Biotec 
Inc.. Auburn, CA ). The resting B cells were suspended in Dulbecco's MEM with 
10% FBS at a concentration of 1 X 10 6 /mL, and 100 ^L was added to the wells of 
96-weIl flat-bottomed plates. 100 ^iL of dilutions of murine CD40L-SPD in 

25 media or media alone were added to the wells, which were incubated in 8.5% C0 2 
at 37 °C for 48 hours. Then, 0.5 ^Ci/well of 3 H-thymidine was added to each 
well, and the cells were collected 4 h later onto glass fiber filters using an 
automated cell harvester. A scintillation counter was used to determine the 
incorporated radioactivity. 
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11. Human B cell cultures. 

Venous blood from consenting subjects was used as a source of human B 
cells under a protocol approved by the UCSD Institutional Review Board. Blood 
was collected into syringes containing 5 U/mL heparin and peripheral blood 
5 mononuclear cells (PBMC) were isolated by centrifugation over Ficoll-hypaque. 
The cells were suspended at 2 X 10 5 /mL in RPMI 1640 containing 200 L- 
glutamine, 10% FBS, 0.832 fiM cyclosporin A (Sigma), and 25 ng/mL human IL- 
4 (R & D Systems) and incubated in 5% C0 2 at 37 °C as described by Schultze et 
ah (36). At intervals, the cells were stained with CyChrome-conjugated anti- 
CD^ and PE-conjugated anti-CD80 (B7-1) monoclonal antibodies (Pharmingen) 
and analyzed by flow cytometry. 

12. Human monocvte-derived macrophage and dendritic cell cultures . 

As previously described [Kornbluth], monocytes were isolated from 
PBMC by adherence to fibronectin-coated plates, plated into 48-well plates, and 
then cultured in RPMI 1640 containing 200 DM L-glutamine and 10% autologous 
serum for 7-10 days. Monolayers of the matured cells (about 2 X 10 5 /well), 
termed monocyte-derived macrophages or MDM, were then washed in media and 
cultured in 1 mL/well RPMI 1640 containing 200 DM L-glutamine and 10% heat- 
inactivated FBS. Alternatively, dendritic cells (DC) were formed from 
monocytes by adding GM-CSF and IL-4 to the culture media, and the resulting 
DC were used 6 days later. Preparations of CD40L-SPD were added to the wells 
as indicated. As a positive control, 100 ng/mL bacterial lipopoly saccharide (LPS) 
from E. coli 01 1 1 :B4 (Calbiochem) was added. Supernatants were collected 24 h 
later and analyzed for cytokine content using ELISA (R&D Systems). 

Example 1. 

Design principles in constructing collectin-TNFSF member fusion proteins. 

To express CD40L and other TNFSF members as stable, multimeric 
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proteins, the coding region of the extracellular, C-terminal portion of CD40L was 
joined in-frame to the collectin, surfactant protein D (SPD). The N-terminus of 
SPD contains two cysteines which form the disulfide bonds necessary for the 4- 
armed cruciate structure of the overall molecule [Brown-Augsburger, 1996 #506]. 

5 C-terminal to these cysteines in SPD is a long triple-helical collagenous "stalk" 
which ends in the "neck" region that promotes the trimerization of each arm of the 
structure. Immediately after this neck region, the coding sequence for the 
extracellular portion of CD40L was added, in place of the carbohydrate 
recognition domain (CRD) of SPD. The collectins were chosen as the framework 

io for the multimeric construct because of their multi-subunit structure and the 

trimeric nature of their stalk regions. Appropriateness of replacing the CRD of a 
collectin with the extracellular region of a TNFSF member is further supported by 
structural studies of the two protein families. An analysis of the CRD crystal 
structure of another collectin, ACRP30, indicated that it was structurally 

15 superimposable upon the crystal structures of the extracellular regions of CD40L, 
TNF, and Fas [Shapiro, 1998]. The successful expression of the collectin-TNFSF 
fusion protein, CD40L-SPD, indicates that other TNFSF members (Table 1) could 
be conjoined to SPD in a similar manner and that other collectins besides SPD 
(Table II) could be used as a protein framework instead of SPD. Because these 

20 molecules are formed entirely from naturally occurring proteins, the production of 
an immune response (e.g., antibodies) to these fusion proteins is minimized. By 
deleting portions of the stalk region of the TNFSF proteins, additional constructs 
can be made which may be even less immunogenic. 

25 Example 2. 

Expression of human and murine CD40L-SPD in CHO cells. 

The coding regions for the extracellular portion of human CD40L, human 
T147N-CD40L ,an inactive mutant of CD40L, or murine CD40L were joined to 
the neck region of murine SPD, replacing the SPD CRD (Fig. 1). A CMV-driven 
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expression plasmid for the construct was co-transfected with a DHFR minigene 
into DNFR-deficient CHO cells. Following selection in nucleoside-free media, 
expressing CHO clones were amplified by culture in ascending doses of 
methotrexate. The resulting clones produced about 1-10 |ig/mL of the fusion 
protein over a 3 day period in media containing FBS. 

Clones were adapted for growth as suspension cells in two types of serum- 
free media. Murine CHO-SPD produced in UltraCHO (BioWhittaker) was 
largely retained (about 60% as determined by ELISA) by a 1,000 kDa cutoff 
ultrafiltration membrane (Pall Corp., Port Washington, NY), consistent with a 
large multimeric complex formed by the stacking of the SPD portion of the 
molecule. However, in CHO-S-SFM II (Life Technologies), nearly all ELISA- 
detectable murine CHO-SPD passed through a 100 kDa cutoff ultrafiltration 
membrane (Millipore), suggesting that the protein was either folding incorrectly 
in this media or was being degraded by proteolysis. Consequently, the 
purification method was optimized for the spent UltraCHO media. 

Example 3. 

Purification of human and murine CD40L-SPD 

Purification procedures were developed for murine CD40L-SPD, but the 
same methods could be applied to human CD40L-SPD with minor modifications. 
Murine CD40L-SPD has a predicted m.w. of 49 kDa per chain, or about 600 kDa 
per 12-chain, cruciate molecule, the amino acid sequence predicts a pi of 9.10. 
Accordingly, conditioned media was concentrated by ultrafiltration through a 100 
kDa cutoff filter, which also fractionates the sample on a size basis. After 
diafiltration into 50 mM bicine, pH 9.00 (also containing 1 mM EDTA added to 
inhibit metalloproteinases), the sample was applied to a variety of cationic 
exchange resins. Using Source 30S (Amersham-Pharmacia), most of the ELISA- 
detectable protein did not bind and was recovered in the flow-through. However, 
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as reported by Morris et al. {Morris}, Fractogel S0 3 650M retained the protein. 
The retention by this tentacular resin and not by Source 30S suggests binding to 
positively charged residues that are not on the protein surface. Using a linear 
NaCl gradient, ELISA-detectable protein elutes at between 0.15-0.30 M NaCl 
5 under these conditions (Fig. 2). In selected experiments, the protein was further 
purified using a Superose 6 sizing column. Most of the ELISA-detectable protein 
eluted in the excluded volume, indicating an apparent m.w. of greater than 1,000 
kDa(Fig. 3). 

10 Example 4. 

Activity of CD40L-SPD on human B cells 

Schultze et al described a system using CD40L-expressing cells plus IL-4 
and cyclosporin A (to inhibit T cell growth) as a means to grow very large 
numbers of B cells from a small sample of blood. Because CD40L activates these 

15 B cells to express high levels of B7 molecules (CD80 and CD86), the proliferating 
B cells were effective in presenting peptide antigens and rival non-dividing 
dendritic cells as antigen-presenting cells (APCs) (36). To determine if the 
CD40L-SPD fusion protein could replace CD40L-expressing cells in this system, 
PBMC were cultured with CD40L-SPD in addition to IL-4 and cyclosporin A. 

20 Under these conditions the cells grew to saturation density every three days. After 
three weeks, the cultures were almost entirely CD 19+ B cells which express high 
levels of CD80 (Fig. 4). This indicates that CD40L-SPD can be used in ex vivo 
systems where a soluble yet effective form of CD40L is needed to stimulate cells 
for immunotherapeutic applications. 

25 

Example 5. 

Activity of CD40L-SPD on murine B cells. 

Resting murine B cells are particularly difficult to stimulate with most 
soluble forms of CD40L. Even with murine CD40L-CD8 fusion proteins, it is 

27 



WO 01/42298 PCT/USOO/07380 

necessary to crosslink the protein with antibodies against CD8 in order to achieve 
maximal proliferation in culture [Klauss, 1999]. Accordingly, resting murine B 
cells were negatively selected with immunomagnetic beads. As shown in Fig. 5, 
murine CD40L-SPD was as effective as anti-IgM antibody in driving B cells to 
proliferate. This indicates that CD40L-SPD can mimic the multivalent 
interactions that occur when a responding cell comes in contact with CD40L- 
bearing activating cells. 

Example 6. 

Activity of CD40L-SPD on human macrophages and dendritic cells. 

CD40L is a powerful stimulant for macrophages (reviewed in (28)) and 
dendritic cells (40). Accordingly, preparations of CD40L-SPD were added to 
monocyte-derived macrophages and the production of MIP-1 □ was used as a 
measure of stimulation. As shown in Fig. 6, both human and murine CD40L-SPD 
were able to stimulate macrophages, whereas the T 1 47N-CD40L-SPD mutant was 
inactive as expected. 

DISCUSSION 

These examples define a new method of producing multimeric (i.e., many 
trimers) of CD40L as a fusion protein with SPD. Also prepared and expressed 
were similar fusion proteins between murine RANKL/TRANCE (TNFSF1 1) or 
murine CD27L/CD70 (TNFSF7) joined to murine SPD (data not shown). This 
suggests that virtually all TNFSF members could be successfully produced as 
fusion proteins with SPD. Furthermore, it is also likely that other collectins 
besides SPD could be used in these fusions, given the strong structural 
homologies between the CRDs of the collectins and the extracellular domains of 
TNFSF members [Shapiro] which can be substituted for these CRDs. Given the 
17 known TNFSF members and 9 known collectins, at least 153 fusion protein 
combinations are possible. 
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SPD was selected for initially because it is a soluble homopolymer. Other 
collectins, such as surfactant protein A, have strong binding affinities to lipids and 
specific cell receptors. Although removal of the CRD abrogates much of this 
binding, it may be partially mediated by the neck region sequence, which the 

5 fusion proteins retain. Accordingly, it would be expected that collectins other 
than SPD might confer different cell-binding and pharmacokinetic behaviors upon 
a fusion protein. For example, macrophages are known to take up and degrade 
whole SPD [Dong, 1998]. If a fusion protein other than SPD were used, the 
disposition of the fusion protein in vivo might be altered. Additionally, 

10 metalloproteinases are known to degrade the collectin, Clq , so that a fusion with 
Clq may alter the degradation of the fusion protein. For example, because 
CD40L activates macrophages and other cells to produce metalloproteinases, 
which could potentially degrade the collagenous portion of SPD and other 
collectins. Cleavage of the collagenous stalk would then be expected to release 

15 single-trimers of CD40L, which could diffuse away from the original parent 
molecule, much like a slow-release formulation of a drug. Also, the membrane- 
proximal portion of CD40L has been retained in CD40L-SPD. This sequence also 
contains protease-susceptible amino acid sequences, which can be eliminated by 
mutagenesis to retard the cleavage of CD40L from the fusion protein. Mutations 

20 in such proteinase cleavage site(s) would delay such cleavage and favor the local 
persistence of the CD40L stimulus. 

CD40L-SPD is a large macromolecule (> 1,000 kDa), and the other 
TNFSF-collectin fusion proteins would be expected to be similarly large. For 
25 native SPD, the aggregates that spontaneously form measure 100 nm in diameter. 
When injected into tissue, this large a complex would be expected to remain at the 
injection site for a prolonged period. Localization of the TNFSF-containing 
protein would also be expected to reduce any systemic toxicity caused by the 
release of free single-trimers into the circulation. For example, soluble CD40L in 
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blood has been linked to disease activity in lupus, and this smaller molecule may 
even cross the glomerulus to cause damage to renal tubules [Kato and Kipps, J. 
Clin. Invest. Nov. 1999]. On the other hand, because CD40L induces the 
production of chemokines which attract immune cells [Kornbluth], T cells, 
monocytes, and dendritic cells would be expected migrate to the site where 
CD40L-SPD was injected. This might be advantageous if CD40L-SPD were used 
as a vaccine adjuvant. In mice, soluble CD40L (sCD40LT) stimulates IgGl 
production but not cytotoxic T lymphocytes (CTLs) [Wong, 1999]. Interestingly, 
the same protein that is expressed from an injected plasmid stimulates both a 
strong antibody and CTL response [Gurunathan, 1 998]. In the latter case, the 
plasmid would be expected to deliver a localized supply of CD40L, whereas the 
sCD40LT protein is free to diffuse away. Support for the localized use of CD40L 
in an adjuvant formulation is provided by a study using a plasmid expressing full- 
length membrane CD40L, which was very effective in stimulating both humoral 
and CTL immune responses [Mendoza, 1997]. Similarly, injection of adenovirus 
expressing membrane CD40L has potent antitumor activity in mice [Kikuchi, 
1999]. Similar considerations would likely apply to other fusion proteins between 
the TNFSF and collectins. 

Finally, for immunostimulatory proteins, it is particularly important that 
the protein not be antigenic if repeated injections are needed. For example, 
vaccination with TNF-u modified by the addition of short peptide sequences was 
able to induce the production of disease-modifying anti-TNF-u autoantibodies 
[Dalum, 1999]. Because CD40L-SPD and other TNFSF-collectin fusion proteins 
are formed from endogenous protein sequences (with the possible exception of the 
peptide sequence at the junction), the production of antibodies might not limit the 
effectiveness of repeated injections. 

In conclusion, fusions between TNFSF members and collectins offer a 



30 



WO 01/42298 PCT/USOO/07380 

novel means of generating large protein complexes which can provide localized 
stimulation at an injection site. Because of the multimeric nature of the collectin 
backbone, such fusion proteins may mimic the multivalent ligand surface 
presented by the membrane forms of TNFSF members to TNFRSF-bearing 
5 responding cells. Moreover, by limiting systemic toxicity while maintaining 
localized efficacy, such fusion proteins may have a role as vaccine adjuvants 
against infectious agents and tumors. 
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Table I 

Ligands of the TNF Superfamily* 


PCT/US00/07380 


New Lieand Svmbol 


Other Names 




Genbank ID 






LTA 


Lymphotoxin-, TNF-D, TNFSF1 


X01393 


TNF 


TNF-D, TNFSF2 


X02910 


LTB 


Lymphotoxin-, TNFSF3 


L11016 


TNFSF4 


OX-40L 


D90224 


TNFSF5 


CD40L, CD154. Gp39, T-BAM 


X67878 


TNFSF6 


FasL 


U11821 


TNFSF7 


CD27L, CD70 


L08096 


TNFSF8 


CD30L 


L09753 


TNFSF9 


4-1BBL 


U03398 


TNFSF10 


TRAIL, Apo-2L 


U37518 


TNFSF1 1 


RANKL, TRANCE, OPGL, ODF 


AF013171 


TNFSF12 


TWEAK, Apo-3L 


AF030099 


TNFSF13 


APRIL 


NM 003808 


TNFSF13B 


BAFF, THANK, BLYS 


AF 136293 


TNFSF14 


LIGHT, HVEM-L 


AF036581 


TNFSF15 


VEGI 


AF039390 


TNFSF16 


unidentified 




TNFSF17 


unidentified 




TNFSF18 


AITRL, GITRL 


AF125303 


*(asof*Nov. 1, 1999) 







Known members of ligands in the TNF superfamily, taken from the Human 
Gene Nomenclature Committee at 
http://www.gene.ucl.ac.uk/users/hester/tnftop.htm 
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Table II 
The Collectin Superfamily 



Clq 

Mannose-binding 
protein, MBL1 
Mannose-binding 
protein, MBL2 
Pulmonary surfactant 
protein A 



Pulmonary surfactant 

protein D 

conglutinin 

colIectin-43 

CL-L1 

ACRP30 

Hib27 



All collectins are formed as multimers of trimeric subunits, each containing a 
collagenous domain. The C-terminus of each collectin contains a CRD which 
binds carbohydrates and other ligands. Because of the tight similarities between 
5 the known CRD structures and the extracellular domains of TNFSF members, it is 
likely that the CRD of any collectin could be replaced with the extracellular 
domain of any TNFSF member in a structurally compatible manner. 

While the present invention has now been described in terms of certain 
10 preferred embodiments, and exemplified with respect thereto, one skilled in the 
art will readily appreciate that various modifications, changes, omissions and 
substitutions may be made without departing from the spirit thereof. It is 
intended, therefore, that the present invention be limited solely by the scope of the 
following claims. 
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CLAIMS 

What is claimed is: 

5 1 . A hybrid nucleotide sequence of no more than 1 528 base pairs including a 

sequence defining a structural gene expressing a conjoined single strand of a 
multimeric TNFSF-SPD fusion protein, said structural gene having a nucleotide 
base sequence selected from members of the group consisting of SEQ ID NO 1, 
SEQ ID NO 3 and SEQ ID NO 5. 

10 

2. The DNA segment according to claim 1 , wherein the structural gene 
comprises: 

a segment expressing a single hybrid amino acid chain of TNFSF-SPD, said 
segment having 

is a first SPD nucleotide base sequence of SEQ ID NO 1 , from base 32 

to base 799, and 

a second sequence, expressing a portion of TNFSF stalk, selected 
from members of the group consisting of SEQ ID NO 1, from base 800 to 
base 1444, SEQ ID NO 3, from base 800 to base 1528, and SEQ ID NO 5, 
20 from base 800 to base 1 44 1 . 

3. A recombinant DNA molecule comprising a vector operatively linked to 
an exogenous DNA segment defining a structural gene expressing a single amino 
acid chain of TNFSF-SPD, said structural gene having 

25 a nucleotide base sequence selected from members of the group consisting of 

SEQ ID NO 1, SEQ ID NO 3 and SEQ ID NO 5, any functional equivalents and 
modifications thereof, and 

an appropriate promoter for driving the expression of said structural gene in a 
compatible host organism. 
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4. The recombinant DNA molecule as described in claim 3 wherein said host 
organism is E. coli. 

5 5. The recombinant DNA molecule as described in claim 3, wherein said 
host organism is a yeast. 

6. The recombinant DNA molecule as described in claim 3, wherein said 
host organism is a higher plant or animal. 

10 

7. A multimeric TNFSF-SPD fusion protein comprising: 
a plurality of polypeptide trimers, 

i) a first trimer consisting of peptide strands of members of the TNF 
superfamily (TNFSF) of ligands, and 
is ii) a second trimer strand from a collectin molecule 

each first trimer conjoined to a second polypeptide trimer 
strand from a collectin molecule, wherein said ligand strand is 
substituted for native carbohydrate recognition domains (CRD) of the 
collectin molecules, 

20 said conjoined collectin strands covalently bound in parallel 

to each other, therein forming a multimeric fusion protein 
comprising a plurality of trimeric hybrid polypeptide strands 
radiating from a covalently bound center hub of the molecule, the 
free end of each trimeric radiating strand having a TNFSF moiety 

25 attached. 

8. The multimeric fusion protein according to claim 6, wherein the TNFSF 
moiety is one selected from the group consisting of Iigands LTA, TNF, LTB, and 
TNFSF4 to TNFSF 18 as in Table II, their funtional equivalents, and 
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9. A method for preparing a CD40-SPD multimeric fusion polypeptide, 
comprising the steps of: 

initiating a culture, in a nutrient medium, of procaryotic or eucaryotic host 
cells transformed with a recombinant DNA molecule including an expression 
vector, appropriate for said cells, operatively linked to an exogenous DNA 
segment defining a structural gene for CD40-SPD ligand, said structural gene 
having a nucleotide base sequence of SEQ ID NO 1 from about base 32 to about 
base 1444; and 

maintaining said culture for a time period sufficient for said cells to express 
said multimeric molecule. 

10. A method of producing a secreted, large, biologically active, multimeric 
tumor necrosis factor superfamily ligand fusion protein chimera that is highly 
immunogenic and not readily diffusable, comprising: 

introducing into a host cell a first chimeric DNA construct including 
a transcriptional promoter operatively linked to a first 
secretory signal sequence, followed downstream by, and in proper 
reading frame with 

a first DNA sequence encoding a polypeptide chain of 
a first TNFSF ligand requiring multimerization for biological 
activity, joined to 

a second DNA sequence encoding a collectin 
polypeptide at the site where the collectin 5 s CRD was 
purposefully removed, 
introducing into said host cell a second DNA construct including 

a transcriptional promoter operably linked to a second 
secretory signal sequence followed downstream by, and in 
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proper reading frame with, 

a third DNA sequence encoding a second polypeptide 
chain of a second TNFSF ligand , joined to 

a fourth DNA sequence encoding a collectin 
polypeptide , wherein the collecting CRD was purposefully 
removed; 

growing said host cell in an appropriate growth medium under 
physiological conditions to allow the secretion of a large multimerized 
polypeptide fusion protein, wherein 

said first polypeptide chain of a TNFSF-SPD protein is bound by parallel 
bonding of the respective collectin domain trimer to 

said second polypeptide chain of a different TNFSF-SPD polypeptide 
trimer, and wherein said multimerized polypeptide fusion protein exhibits 
biological activity characteristic of both membrane-attached TNFSFs; and 

isolating said biologically active, multimerized TNFSF-SPD polypeptide 
fusion from said host cell. 

1 1. The method according to claim 10, wherein the chimeric reactant 
compounds are humanized to guard against destruction by a potential human 
recipient's immune system. 

12. A method of preparing a multimeric TNFSF-SPD ligand fusion protein, 
comprising: 

preparing a first DNA segment coding for a strand of an exposed 
extracellular portion of TNFSF; 

preparing a second DNA segment coding for a collectin polypeptide 
strand, wherein said collectin' s CRD domain of the strand has been 
removed; 

conjoining said first and second DNAs in proper reading frame, therein 
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creating a TNFSF-collectin DNA construct; 

inserting said construct into an expression vector system; 

introducing said vector system into an appropriate cell in culture under 
suitable conditions; 
5 harvesting and purifying spent medium from said culture; and 

assaying for presence of multimeric TNFSF-collectin fusion protein. 

13. A method for stimulating the immune response in potentially 
immonocompetent cells using multimeric TNFSF fusion proteins, comprising: 

10 contacting said cells with said multimeric TNFSF fusion proteins, wherein 

said cells are induced to proliferate. 

14. The method according to claim 15, wherein the cells are resting B cells. 

15 15. A method for increasing antigenicity of cells, comprising: 

contacting said cells with said multimeric TNFSF fusion proteins, wherein said 
cells are tumor cells or HIV positive cells. 
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SEQUENCE LISTING 

<110> Kornbluth, Richard S 

<120> Multimeric forms of CD40L and other TNF family members 

<130> TNFSF-collectin fusion proteins 

<140> 60/111,471 
<141> 1998-12-08 

<160> 6 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1552 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> 5'UTR 
<222> (7) . . (31) 

<220> 

<221> misc_feature 
<222> (88) . . (799) 

<223> Mature murine surfactant protein D including hub 
region, collagenous portion, and neck, but 
excluding carbohydrate recognition domain (CRD) 

<220> 

<221> misc_feature 
<222> (801) . . (1546) 

<223> Human CD40 ligand extracellular region, including 
stalk. 

<220> 

<221> sig_peptide 
<222> (32) . . (88) 

<223> Signal peptide from murine surfactant protein D 
<220> 

1 
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<221> CDS 

<222> (32).. (1444) 

<220> 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (without the CRD) fused to 
the extracellular portion of human CD40L 

<300> 

<301> Spriggs, Melanie K. 

Armitage, Richard J. 

Strockbine, L 

Clifford, K N. 

Macduff, B M. 

Sato, T A. 

Maliszewski, C R. 

Fanslow, William C. 
<302> Recombinant human CD40 ligand stimulates B cell 

proliferation and immunoglobulin E secretion. 
<303> J. Exp. Med. 
<304> 176 
<305> 6 

<306> 1543-1550 

<307> 1992 

<313> 801 TO 1600 

<300> 

<301> Motwani, M 
White, R A. 

<302> Mouse surfactant protein-D. cDNA cloning, 

characterization, and gene localization to chromosome 
14. 

<303> J. Immunol. 
<304> 155 
<305> 12 

<306> 5671-5677 ' 
<307> 1995 
<313> 32 TO 800 

<400> 1 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 
52 

Met Leu Pro Phe Leu Ser Met 
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ctt gtc ttg ctt 
100 

Leu Val Leu Leu 
10 

age etc teg cag 
148 

Ser Leu Ser Gin 
25 

age cca aca gag 
196 

Ser Pro Thr Glu 
40 

aaa ggt cca egg 
244 

Glu Gly Pro Arg 



ggg etc tea ggg 
292 

Gly Leu Ser Gly 
75 

gag aat ggc tct 
340 

Glu Asn Gly Ser 
90 

agt gga cct cca 
388 

Ser Gly Pro Pro 
105 

ggt ccc tct ggg 
436 

Gly Pro Ser Gly 
120 



gta cag ccc ctg 

Val Gin Pro Leu 
15 

aga tea gta ccc 

Arg Ser Val Pro 
30 

aat ggc ctg cct 

Asn Gly Leu Pro 
45 

ggt gag aag ggt 

Gly Glu Lys Gly 
60 

ttg cag ggc cct 
Leu Gin Gly Pro 

get ggc gaa cct 

Ala Gly Glu Pro 
95 

gga ctt cca ggt 

Gly Leu Pro Gly 
110 

aag cag ggg aac 

Lys Gin Gly Asn 
125 



1 

gga aat ctg gga 
Gly Asn Leu Gly 

aac ace tgc ace 

Asn Thr Cys Thr 
35 

ggt cgt gat gga 

Gly Arg Asp Gly 
50 

gat cca ggt ttg 

Asp Pro Gly Leu 
65 

aca ggt cca gtt 

Thr Gly Pro Val 
80 

gga cca aag gga 
Gly Pro Lys Gly 

att cct ggt cca 

He Pro Gly Pro 
115 

ata gga cct caa 

He Gly Pro Gin 
130 



5 

gca gaa atg aag 

Ala Glu Met Lys 
20 

eta gtc atg tgt 
Leu Val Met Cys 

egg gat ggg aga 

Arg Asp Gly Arg 
55 

cca gga cct atg 

Pro Gly Pro Met 
70 

gga ccc aaa gga 

Gly Pro Lys Gly 
85 

gaa cgt gga eta 

Glu Arg Gly Leu 
100 

get ggg aaa gaa 
Ala Gly Lys Glu 

ggc aaa cca ggt 

Gly Lys Pro Gly 
135 
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cct aaa gga gag get ggg ccc aaa gga gaa gta ggt get cct ggc atg 
484 

Pro Lys Gly Glu Ala Gly Pro Lys Gly Glu Val Gly Ala Pro Gly Met 

140 145 150 

caa gga tct aca ggg gca aaa ggc tec aca ggc ccc aag gga gaa aga 
532 

Gin Gly Ser Thr Gly Ala Lys Gly Ser Thr Gly Pro Lys Gly Glu Arg 

155 160 165 

ggt gec cct ggt gtg caa gga gec cca ggg aat get gga gca gca gga 
580 

Gly Ala Pro Gly Val Gin Gly Ala Pro Gly Asn Ala Gly Ala Ala Gly 
170 " 175 180 

cct gec gga cct gee ggt cca cag gga get cca ggt tec agg ggg ccc 
628 

Pro Ala Gly Pro Ala Gly Pro Gin Gly Ala Pro Gly Ser Arg Gly Pro 

185 " 190 195 

cca gga etc aag ggg gac aga ggt gtt cct gga gac aga gga ate aaa 
676 

Pro Gly Leu Lys Gly Asp Arg Gly Val Pro Gly Asp Arg Gly lie Lys 

200 205 210 215 

ggt gaa age ggg ctt cca gac agt get get ctg agg cag cag atg gag 
724 

Gly Glu Ser Gly Leu Pro Asp Ser Ala Ala Leu Arg Gin Gin Met Glu 

220 225 230 

gee tta aaa gga aaa eta cag cgt eta gag gtt gec ttc tec cac tat 
772 

Ala Leu Lys Gly Lys Leu Gin Arg Leu Glu Val Ala Phe Ser His Tyr 
235 240 245 

cag aaa get gca ttg ttc cct gat ggc cat aga agg ttg gac aag ata 
820 

Gin Lys Ala Ala Leu Phe Pro Asp Gly His Arg Arg Leu Asp Lys lie 
250 255 260 

gaa gat gaa agg aat ctt cat gaa gat ttt gta ttc atg aaa acg ata 
868 

Glu Asp Glu Arg Asn Leu His Glu Asp Phe Val Phe Met Lys Thr He 
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265 270 275 

cag aga tgc aac aca gga gaa aga tec tta tec tta ctg aac tgt gag 
916 

Gin Arg Cys Asn Thr Gly Glu Arg Ser Leu Ser Leu Leu Asn Cys Glu 

280 285 290 295 

gag att aaa age cag ttt gaa ggc ttt gtg aag gat ata atg tta aac 
964 

Glu He Lys Ser Gin Phe Glu Gly Phe Val Lys Asp He Met Leu Asn 

300 305 310 

aaa gag gag acg aag aaa gaa aac age ttt gaa atg caa aaa ggt gat 
1012 

Lys Glu Glu Thr Lys Lys Glu Asn Ser Phe Glu Met Gin Lys Gly Asp 
315 320 325 

cag aat cct caa att gcg gca cat gtc ata agt gag gee age agt aaa 
1060 

Gin Asn Pro Gin He Ala Ala His Val He Ser Glu Ala Ser Ser Lys 
330 335 340 

aca aca tct gtg tta cag tgg get gaa aaa gga tac tac ace atg age 
1108 

Thr Thr Ser Val Leu Gin Trp Ala Glu Lys Gly Tyr Tyr Thr Met Ser 
345 350 355 

aac aac ttg gta acc ctg gaa aat ggg aaa cag ctg ace gtt aaa aga 
1156 

Asn Asn Leu Val Thr Leu Glu Asn Gly Lys Gin Leu Thr Val Lys Arg 
360 365 370 375 

caa gga etc tat tat ate tat gee caa gtc acc ttc tgt tec aat egg 
1204 

Gin Gly Leu Tyr Tyr He Tyr Ala Gin Val Thr Phe Cys Ser Asn Arg 

380 385 390 

gaa get teg agt caa get cca ttt ata gee age etc tgc eta aag tec 
1252 

Glu Ala Ser Ser Gin Ala Pro Phe He Ala Ser Leu Cys Leu Lys Ser 
395 400 405 
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ccc ggt aga ttc gag aga ate tta etc aga get gca aat ace cac agt 
1300 

Pro Gly Arg Phe Glu Arg lie Leu Leu Arg Ala Ala Asn Thr His Ser 
410 415 420 

tec gec aaa cct tgc ggg caa caa tec att cac ttg gga gga gta ttt 
1348 

Ser Ala Lys Pro Cys Gly Gin Gin Ser lie His Leu Gly Gly Val Phe 
425 430 435 

gaa ttg caa cca ggt get teg gtg ttt gtc aat gtg act gat cca age 
1396 

Glu Leu Gin Pro Gly Ala Ser Val Phe Val Asn Val Thr Asp Pro Ser 
440 445 450 455 

caa gtg age cat ggc act ggc ttc acg tec ttt ggc tta etc aaa etc 
1444 

, Gin Val Ser His Gly Thr Gly Phe Thr Ser Phe Gly Leu Leu Lys Leu 

460 465 470 

tgaacagtgt caccttgcag gctgtggtgg agetgacget gggagtcttc ataatacagc 
1504 

acaggcttaa geccaattat acactccaag gcatgtagaa ctggtacc 
1552 



<210> 2 
<211> 471 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (without the CRD) fused to 
the extracellular portion of human CD40L 

<400> 2 

Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Gin Pro Leu Gly 
15 10 15 

Asn Leu Gly Ala Glu Met Lys Ser Leu Ser Gin Arg Ser Val Pro Asn 
20 25 30 

Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
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35 



40 



45 



Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 " 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 

85 90 ~ 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly lie 
100 105 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Gin Gly Asn He 
115 120 125 

Gly Pro Gin Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 

Glu Val Gly Ala Pro Gly Met Gin Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 " 160 

Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Gin Gly Ala Pro 

165 170 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Gin Gly 
180 185 190 

Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 200 205 

Pro Gly Asp Arg Gly He Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Gin Gin Met Glu Ala Leu Lys Gly Lys Leu Gin Arg Leu 
225 230 235 240 

Glu Val Ala Phe Ser His Tyr Gin Lys Ala Ala Leu Phe Pro Asp Gly 

245 250 255 



His Arg Arg Leu Asp Lys He Glu Asp Glu Arg Asn Leu His Glu Asp 
260 265 270 
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Phe Val Phe Met Lys Thr lie Gin Arg Cys Asn Thr Gly Glu Arg Ser 
275 280 285 

Leu Ser Leu Leu Asn Cys Glu Glu lie Lys Ser Gin Phe Glu Gly Phe 
290 295 300 

Val Lys Asp lie Met Leu Asn Lys Glu Glu Thr Lys Lys Glu Asn Ser 
305 310 315 320 

Phe Glu Met Gin Lys Gly Asp Gin Asn Pro Gin He Ala Ala His Val 

325 330 335 

He Ser Glu Ala Ser Ser Lys Thr Thr Ser Val Leu Gin Trp Ala Glu 
340 345 350 

Lys Gly Tyr Tyr Thr Met Ser Asn Asn Leu Val Thr Leu Glu Asn Gly 
355 360 365 

Lys Gin Leu Thr Val Lys Arg Gin Gly Leu Tyr Tyr He Tyr Ala Gin 
370 375 380 

Val Thr Phe Cys Ser Asn Arg Glu Ala Ser Ser Gin Ala Pro Phe He 
385 390 395 400 

Ala Ser Leu Cys Leu Lys Ser Pro Gly Arg Phe Glu Arg He Leu Leu 

405 410 415 

Arg Ala Ala Asn Thr His Ser Ser Ala Lys Pro Cys Gly Gin Gin Ser 
420 425 430 

He His Leu Gly Gly Val Phe Glu Leu Gin Pro Gly Ala Ser Val Phe 
435 440 445 

Val Asn Val Thr Asp Pro Ser Gin Val Ser His Gly Thr Gly Phe Thr 
450 455 460 

Ser Phe Gly Leu Leu Lys Leu 
465 470 



<210> 3 
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<211> 1574 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> 5'UTR 
<222> (7) . . (31) 

<223> 5* UTR taken from rat sequence for surfactant 
protein D 

<220> 

<221> sig_peptide 
<222> (32) . . (88) 

<223> Signal peptide from murine surfactant protein D 

<220> 

<221> CDS 

<222> (32) . . (1534) 

<220> 

<221> misc_feature 
<222> (32) . . (800) 

<223> Murine surfactant protein D including hub region, 
collagenous portion, and neck, but excluding 
carbohydrate recognition domain (CRD) 

<220> 

<221> misc_feature 
<222> (801) . . (1534) 

<223> Murine RANKL/TRANCE extracellular region, 
including stalk 

<300> 

<301> Motwani, M 
White, R A. 
Guo, N 
Dowler, L L. 
Tauber, A I. 
Motwani, M 

<302> Mouse surfactant protein-D. cDNA cloning, 

characterization, and gene localization to chromosome 
14. 

<303> J. Immunol. 
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<304> 155 
<305> 12 
<306> 5671-5677 
<307> 1995 
<313> 32 TO 800 

<300> 

<301> Anderson, D M. 
Maraskovsky, E 
Billingsley, W L. 
Dougall, W C. 

<302> A homologue of the TNF receptor and its ligand enhance 

T-cell growth and dendritic-cell function. 
<303> Nature 
<304> 390 
<305> 6656 
<306> 175-179 
<307> 1997 
<313> 801 TO 1534 

<400> 3 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 
52 

Met Leu Pro Phe Leu Ser Met 

1 5 

ctt gtc ttg ctt gta cag ccc ctg gga aat ctg gga gca gaa atg aag 
100 

Leu Val Leu Leu Val Gin Pro Leu Gly Asn Leu Gly Ala Glu Met Lys 
10 15 20 

age etc teg cag aga tea gta ccc aac acc tgc acc eta gtc atg tgt 
148 

Ser Leu Ser Gin Arg Ser Val Pro Asn Thr Cys Thr Leu Val Met Cys 
25 30 35 

age cca aca gag aat ggc ctg cct ggt cgt gat gga egg gat ggg aga 
196 

Ser Pro Thr Glu Asn Gly Leu Pro Gly Arg Asp Gly Arg Asp Gly Arg 

40 45 50 55 

gaa ggt cca egg ggt gag aag ggt gat cca ggt ttg cca gga cct atg 
244 
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Glu 


Gly 


Pro 


Arg 


Gly 
60 


Glu 


Lys 


Gly Asp 


Pro Gly Leu 
65 


Pro 


Gly 


Pro 
70 


Met 


ggg 

292 


etc 


tea 


ggg 


ttg 


cag 


ggc 


cct 


aca 


ggt 


cca 


gtt 


gga 


ccc 


aaa 


gga 


Gly 


Leu 


Ser 


Gly Leu 
75 


Gin 


Gly 


Pro Thr Gly 
80 


Pro 


Val 


Gly 


Pro 
85 


Lys 


Gly 


gag 
340 


aat 


ggc 


tct 


get 


ggc 


gaa 


cct 


gga 


cca 


aag 


gga 


gaa 


cgt 


gga 


eta 


Glu 


Asn 


Gly 
90 


Ser 


Ala 


Gly 


Glu 


Pro Gly 
95 


Pro 


Lys 


Gly 


Glu Arg 
100 


Gly 


Leu 


agt 
388 


gga 


cct 


cca 


gga 


ctt 


cca 


ggt 


att 


cct 


ggt 


cca 


get 


ggg 


aaa 


gaa 


Ser 


Gly 
105 


Pro 


Pro Gly 


Leu 


Pro Gly 
110 


He 


Pro 


Gly 


Pro 
115 


Ala Gly 


Lys 


Glu 


ggt 
436 


ccc 


tct 


ggg 


aag 


cag 


ggg 


aac 


ata 


gga 


cct 


caa 


ggc 


aaa 


cca 


ggt 


Gly 
120 


Pro 


Ser 


Gly Lys 


Gin 
125 


Gly Asn 


He 


Gly 


Pro 
130 


Gin 


Gly 


Lys 


Pro 


Gly 
135 


cct 
484 


aaa 


gga 


gag 


get 


ggg 


ccc 


aaa 


gga 


gaa 


gta 


ggt 


get 


cct 


ggc 


atg 


Pro 


Lys 


Gly 


Glu 


Ala 
140 


Gly 


Pro 


Lys 


Gly Glu 
145 


Val 


Gly 


Ala 


Pro 


Gly 
150 


Met 


caa 
532 


gga 


tct 


aca 


ggg 


gca 


aaa 


ggc 


tec 


aca 


ggc 


ccc 


aag 


gga 


gaa 


aga 


Gin 


Gly 


Ser 


Thr 
155 


Gly Ala 


Lys 


Gly 


Ser 
160 


Thr 


Gly 


Pro 


Lys 


Gly 
165 


Glu 


Arg 


ggt 
580 


gec 


cct 


ggt 


gtg 


caa 


gga 


gee 


cca 


ggg 


aat 


get 


gga 


gca 


gca 


gga 


Gly 


Ala 


Pro 
170 


Gly Val 


Gin 


Gly Ala 
175 


Pro Gly Asn Ala 


Gly Ala 
180 


Ala 


Gly 


cct 
628 


gec 


gga 


cct 


gec 


ggt 


cca 


cag 


gga 


get 


cca 


ggt 


tec 


agg 


ggg 


ccc 


Pro 


Ala 
185 


Gly 


Pro 


Ala 


Gly 


Pro 
190 


Gin 


Gly Ala 


Pro Gly 
195 


Ser Arg 


Gly 


Pro 
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cca gga etc aag ggg gac aga ggt gtt cct gga gac aga gga ate aaa 
676 

Pro Gly Leu Lys Gly Asp Arg Gly Val Pro Gly Asp Arg Gly lie Lys 

200 205 210 215 

ggt gaa age ggg ctt cca gac agt get get ctg agg cag cag atg gag 
724 

Gly Glu Ser Gly Leu Pro Asp Ser Ala Ala Leu Arg Gin Gin Met Glu 

220 225 ~ 230 

gee tta aaa gga aaa eta cag cgt eta gag gtt gec ttc tec cac tat 
772 

Ala Leu Lys Gly Lys Leu Gin Arg Leu Glu Val Ala Phe Ser His Tyr 
235 240 245 

cag aaa get gca ttg ttc cct gat ggt cga gcg cag atg gat cct aac 
820 

Gin Lys Ala Ala Leu Phe Pro Asp Gly Arg Ala Gin Met Asp Pro Asn 
250 255 260 

aga ata tea gaa gac age act cac tgc ttt tat aga ate ctg aga etc 
868 

Arg lie Ser Glu Asp Ser Thr His Cys Phe Tyr Arg lie Leu Arg Leu 
265 270 275 

cat gaa aac gca ggt ttg cag gac teg act ctg gag agt gaa gac aca 
916 

His Glu Asn Ala Gly Leu Gin Asp Ser Thr Leu Glu Ser Glu Asp Thr 

280 285 290 * 295 

eta cct gac tec tgc agg agg atg aaa caa gee ttt cag ggg gee gtg 
964 

Leu Pro Asp Ser Cys Arg Arg Met Lys Gin Ala Phe Gin Gly Ala Val 

300 305 310 

cag aag gaa ctg caa cac att gtg ggg cca cag cgc ttc tea gga get 
1012 

Gin Lys Glu Leu Gin His lie Val Gly Pro Gin Arg Phe Ser Gly Ala 
315 320 325 

cca get atg atg gaa ggc tea tgg ttg gat gtg gec cag cga ggc aag 
1060 

Pro Ala Met Met Glu Gly Ser Trp Leu Asp Val Ala Gin Arg Gly Lys 

12 
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330 

cct gag gcc cag 
1108 

Pro Glu Ala Gin 
345 

cca teg ggt tec 
1156 

Pro Ser Gly Ser 
360 

ggc tgg gcc aag 
1204 

Gly Trp Ala Lys 



gtt aac caa gat 
1252 

Val Asn Gin Asp 
395 

cat cat gaa aca 
1300 

His His Glu Thr 
410 

gtg tat gtc gtt 
1348 

Val Tyr Val Val 
425 

atg aaa gga ggg 
1396 

Met Lys Gly Gly 
440 

ttt tat tec ata 
1444 

Phe Tyr Ser He 



335 

cca ttt gca cac 

Pro Phe Ala His 
350 

cat aaa gtc act 

His Lys Val Thr 
365 

ate tct aac atg 

He Ser Asn Met 
380 

ggc ttc tat tac 
Gly Phe Tyr Tyr 

teg gga age gta 

Ser Gly Ser Val 
415 

aaa ace age ate 

Lys Thr Ser He 
430 

age acg aaa aac 

Ser Thr Lys Asn 
445 

aat gtt ggg gga 

Asn Val Gly Gly 
460 



etc ace ate aat 

Leu Thr He Asn 
355 

ctg tec tct tgg 

Leu Ser Ser Trp 
370 

acg tta age aac 

Thr Leu Ser Asn 
385 

ctg tac gcc aac 

Leu Tyr Ala Asn 
400 

cct aca gac tat 
Pro Thr Asp Tyr 

aaa ate cca agt 

Lys lie Pro Ser 
435 

tgg teg ggc aat 

Trp Ser Gly Asn 
450 

ttt ttc aag etc 

Phe Phe Lys Leu 
4 65 

13 



340 

get gcc age ate 
Ala Ala Ser He 

tac cac gat cga 

Tyr His Asp Arg 
375 

gga aaa eta agg 

Gly Lys Leu Arg 
390 

att tgc ttt egg 

lie Cys Phe Arg 
405 

ctt cag ctg atg 

Leu Gin Leu Met 
420 

tct cat aac ctg 
Ser His Asn Leu 

tct gaa ttc cac 

Ser Glu Phe His 
455 

cga get ggt gaa 

Arg Ala Gly Glu 
470 
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gaa att age att cag gtg tec aac cct tec ctg ctg gat ccg gat caa 
1492 

Glu lie Ser He Gin Val Ser Asn Pro Ser Leu Leu Asp Pro Asp Gin 
475 480 485 

gat gcg acg tac ttt ggg get ttc aaa gtt cag gac ata gac 
1534 

Asp Ala Thr Tyr Phe Gly Ala Phe Lys Val Gin Asp He Asp 
490 495 500 

tgagactcat ttcgtggaac attageggee gctaaactat 
1574 



<210> 4 
<211> 501 
<212> PRT 

<213> Artificial Sequence 
<400> 4 

Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Gin Pro Leu Gly 
15 10 15 

Asn Leu Gly Ala Glu Met Lys Ser Leu Ser Gin Arg Ser Val Pro Asn 
20 25 30 

Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 " 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 

85 90 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly He 
100 105 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Gin Gly Asn He 
115 120 125 
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Gly Pro Gin Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 

Glu Val Gly Ala Pro Gly Met Gin Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 160 

Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Gin Gly Ala Pro 

165 170 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Gin Gly 
180 185 " 190 

Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 200 205 

Pro Gly Asp Arg Gly He Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Gin Gin Met Glu Ala Leu Lys Gly Lys Leu Gin Arg Leu 
225 230 235 240 

Glu Val Ala Phe Ser His Tyr Gin Lys Ala Ala Leu Phe Pro Asp Gly 

245 250 255 

Arg Ala Gin Met Asp Pro Asn Arg He Ser Glu Asp Ser Thr His Cys 
260 265 270 

Phe Tyr Arg He Leu Arg Leu His Glu Asn Ala Gly Leu Gin Asp Ser 
275 280 '285 

Thr Leu Glu Ser Glu Asp Thr Leu Pro Asp Ser Cys Arg Arg Met Lys 
290 295 300 

Gin Ala Phe Gin Gly Ala Val Gin Lys Glu Leu Gin His He Val Gly 
305 310 315 320 

Pro Gin Arg Phe Ser Gly Ala Pro Ala Met Met Glu Gly Ser Trp Leu 

325 330 335 

Asp Val Ala Gin Arg Gly Lys Pro Glu Ala Gin Pro Phe Ala His Leu 
340 345 350 
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Thr He Asn Ala Ala Ser He 
355 



Pro Ser Gly Ser His 
360 



Lys Val Thr Leu 
365 



Ser Ser Trp Tyr His Asp Arg 
370 375 



Gly Trp Ala Lys He 

380 



Ser Asn Met Thr 



Leu Ser Asn Gly Lys Leu Arg 
385 390 



Val Asn Gin Asp Gly 
395 



Phe Tyr Tyr Leu 
400 



Tyr Ala Asn He Cys Phe Arg 

405 



His His Glu Thr Ser 
410 



Gly Ser Val Pro 
415 



Thr Asp Tyr Leu Gin Leu Met 
420 



Val Tyr Val Val Lys 
425 



Thr Ser He Lys 
430 



He Pro Ser Ser His Asn Leu 
435 



Met Lys Gly Gly Ser 
440 



Thr Lys Asn Trp 
445 



Ser Gly Asn Ser Glu Phe His 
450 455 



Phe Tyr Ser He Asn 

460 



Val Gly Gly Phe 



Phe Lys Leu Arg Ala Gly Glu 
465 470 



Glu He Ser He Gin 
475 



Val Ser Asn Pro 
480 



Ser Leu Leu Asp Pro Asp Gin 

485 



Asp Ala Thr Tyr Phe 
490 



Gly Ala Phe Lys 
4 95 



Val Gin Asp He Asp 
500 



<210> 5 
<211> 1477 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (except CRD) fused to the 
extracellular domain of murine CD40 ligand 

<220> 
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<221> 5'UTR 
<222> (7) . . (31) 

<223> 5' UTR from rat surfactant protein D 
<220> 

<221> sig_peptide 
<222> (32).. (88) 

<223> Signal peptide from murine surfactant protein D 

<220> 

<221> CDS 

<222> (32) . . (1441) 

<220> 

<221> misc_recomb 
<222> (88) . . (799) 

<223> Mature murine surfactant protein D including hub 
region, collagenous portion, and neck, but 
excluding carbohydrate recognition domain (CRD) 

<220> 

<221> misc_feature 
<222> (800) . . (1441) 

<223> Murine CD40 ligand extracellular region, including 
stalk 

<300> 

<301> Motwani, M 
White, R A. 
Guo, N 
Dowler, L L. 
Tauber, A I. 
Motwani, M 

<302> Mouse surfactant protein-D. cDNA cloning, 

characterization, and gene localization to chromosome 
14. 

<303> J. Immunol. 
<304> 155 
<305> 12 
<306> 5671-5677 
<307> 1995 
<313> 32 TO 800 
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<300> 

<301> Armitage, R 
Fanslow, W 
Sato, T A. 
Clifford, K N. 

<302> Molecular and biological characterization of a murine 

ligand for CD40 
<303> Nature 
<304> 357 
<305> 6373 
<306> 80-82 
<307> 1992 
<313> 801 TO 1441 

<400> 5 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 
52 

Met Leu Pro Phe Leu Ser Met 
1 5 

ctt gtc ttg ctt gta cag ccc ctg gga aat ctg gga gca gaa atg aag 
100 

Leu Val Leu Leu Val Gin Pro Leu Gly Asn Leu G.y Ala Glu Met Lys 
10 15 20 

age etc teg cag aga tea gta ccc aac acc tgc acc eta gtc atg tgt 
148 

Ser Leu Ser Gin Arg Ser Val Pro Asn Thr Cys Thr Leu Val Met Cys 
25 30 35 

age cca aca gag aat ggc ctg cct ggt cgt gat gga egg gat ggg aga 
196 

Ser Pro Thr Glu Asn Gly Leu Pro Gly Arg Asp Gly Arg Asp Gly Arg 
40 45 50 55 

gaa ggt cca egg ggt gag aag ggt gat cca ggt ttg cca gga cct atg 
244 

Glu Gly Pro Arg Gly Glu Lys Gly Asp Pro Gly Leu Pro Gly Pro Met 

60 65 70 

ggg etc tea ggg ttg cag ggc cct aca ggt cca gtt gga ccc aaa gga 
292 

Gly Leu Ser Gly Leu Gin Gly Pro Thr Gly Pro Val Gly Pro Lys Gly 
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75 

gag aat ggc tct 
340 

Glu Asn Gly Ser 
90 

agt gga cct cca 
388 

Ser Gly Pro Pro 
105 

ggt ccc tct ggg 
436 

Gly Pro Ser Gly 
120 

cct aaa gga gag 
484 

Pro Lys Gly Glu 



caa gga tct aca 
532 

Gin Gly Ser Thr 
155 

ggt gcc cct ggt 
580 

Gly Ala Pro Gly 
170 

cct gcc gga cct 
628 

Pro Ala Gly Pro 
185 

cca gga etc aag 
676 

Pro Gly Leu Lys 
200 



get ggc gaa 
Ala Gly Glu 

gga ctt cca 

Gly Leu Pro 
110 

aag cag ggg 

Lys Gin Gly 
125 

get ggg ccc 

Ala Gly Pro 
140 

ggg gca aaa 
Gly Ala Lys 

gtg caa gga 
Val Gin Gly 

gcc ggt cca 

Ala Gly Pro 
190 

ggg gac aga 

Gly Asp Arg 
205 



80 

cct gga cca 

Pro Gly Pro 
95 

ggt att cct 
Gly lie Pro 

aac ata gga 
Asn lie Gly 

aaa gga gaa 

Lys Gly Glu 
145 

ggc tec aca 

Gly Ser Thr 
160 

gcc cca ggg 

Ala Pro Gly 
175 

cag gga get 
Gin Gly Ala 

ggt gtt cct 
Gly Val Pro 



aag gga gaa 

Lys Gly Glu 
100 

ggt cca*gct 

Gly Pro Ala 
115 

cct caa ggc 

Pro Gin Gly 
130 

gta ggt get 
Val Gly Ala 

ggc ccc aag 
Gly Pro Lys 

aat get gga 

Asn Ala Gly 
180 

cca ggt tec 

Pro Gly Ser 
195 

gga gac aga 

Gly Asp Arg 
210 



85 

cgt gga eta 
Arg Gly Leu 

ggg aaa gaa 
Gly Lys Glu 

aaa cca ggt 

Lys Pro Gly 
135 

cct ggc atg 

Pro Gly Met 
150 

gga gaa aga 

Gly Glu Arg 
165 

gca gca gga 
Ala Ala Gly 

agg ggg ccc 
Arg Gly Pro 

gga ate aaa 

Gly lie Lys 
215 
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ggt gaa age ggg 
724 

Gly Glu Ser Gly 



gec tta aaa gga 
772 

Ala Leu Lys Gly 
235 

cag aaa get gca 
820 

Gin Lys Ala Ala 
250 

gaa gag gaa gta 
868 

Glu Glu Glu Val 
265 

aag aga tgc aac 
916 

Lys Arg Cys Asn 
280 

gag atg aga agg 
964 

Glu Met Arg Arg 



aaa gaa gag aaa 
1012 

Lys Glu Glu Lys 
315 

gat cct caa att 
1060 

Asp Pro Gin lie 
330 

gca tec gtt eta 
1108 

Ala Ser Val Leu 



ctt cca gac agt 

Leu Pro Asp Ser 
220 

aaa eta cag cgt 
Lys Leu Gin Arg 

ttg ttc cct gat 

Leu Phe Pro Asp 
255 

aac ctt cat gaa 

Asn Leu His Glu 
270 

aaa gga gaa gga 

Lys Gly Glu Gly 
285 

caa ttt gaa gac 

Gin Phe Glu Asp 
300 

aaa gaa aac age 
Lys Glu Asn Ser 

gca gca cac gtt 

Ala Ala His Val 
335 

cag tgg gec aag 
Gin Trp Ala Lys 



get get ctg agg 

Ala Ala Leu Arg 
225 

eta gag gtt gec 

Leu Glu Val Ala 
240 

ggc cat aga aga 
Gly His Arg Arg 

gat ttt gta ttc 

Asp Phe Val Phe 
275 

tct tta tec ttg 

Ser Leu Ser Leu 
290 

ctt gtc aag gat 

Leu Val Lys Asp 
305 

ttt gaa atg caa 

Phe Glu Met Gin 
320 

gta age gaa gec 
Val Ser Glu Ala 

aaa gga tat tat 

Lys Gly Tyr Tyr 
20 



cag cag atg gag 

Gin Gin Met Glu 
230 

ttc tec cac tat 

Phe Ser His Tyr 
245 

ttg gat aag gtc 

Leu Asp Lys Val 
260 

ata aaa aag eta 
lie Lys Lys Leu 

ctg aac tgt gag 

Leu Asn Cys Glu 
295 

ata acg tta aac 

lie Thr Leu Asn 
310 

aga ggt gat gag 

Arg Gly Asp Glu 
325 

aac agt aat gca 

Asn Ser Asn Ala 
340 

acc atg aaa age 

Thr Met Lys Ser 
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345 350 355 

aac ttg gta atg ctt gaa aat ggg aaa cag ctg' acg gtt aaa aga gaa 
1156 

Asn Leu Val Met Leu Glu Asn Gly Lys Gin Leu Thr Val Lys Arg Glu 

360 365 370 " 375 

gga etc tat tat gtc tac act caa gtc acc ttc tgc tct aat egg gag 
1204 

Gly Leu Tyr Tyr Val Tyr Thr Gin Val Thr Phe Cys Ser Asn Arg Glu 

380 385 390 

cct teg agt caa cgc cca ttc ate gtc ggc etc tgg ctg aag ccc age 
1252 

Pro Ser Ser Gin Arg Pro Phe lie Val Gly Leu Trp Leu Lys Pro Ser 

395 400 405 

att gga tct gag aga ate tta etc aag gcg gca aat acc cac agt tec 
1300 

lie Gly Ser Glu Arg lie Leu Leu Lys Ala Ala Asn Thr His Ser Ser 
410 415 420 

tec cag ctt tgc gag cag cag tct gtt cac ttg ggc gga gtg ttt gaa 
1348 

Ser Gin Leu Cys Glu Gin Gin Ser Val His Leu Gly Gly Val Phe Glu 

425 430 435 

tta caa get ggt get tct gtg ttt gtc aac gtg act gaa gca age caa 
1396 

Leu Gin Ala Gly Ala Ser Val Phe Val Asn Val Thr Glu Ala Ser Gin 

440 445 450 455 

gtg ate cac aga gtt ggc ttc tea tct ttt ggc tta etc aaa etc 
1441 

Val lie His Arg Val Gly Phe Ser Ser Phe Gly Leu Leu Lys Leu 

460 465 470 

tgaacagtgc gctgtcctag getgeagcag ggtacc 
1477 



<210> 6 
<211> 470 
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<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (except CRD) fused to the 
extracellular domain of murine CD4 0 ligand 

<400> 6 

Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Gin Pro Leu Gly 
15 10 15 

Asn Leu Gly Ala Glu Met Lys Ser Leu Ser Gin Arg Ser Val Pro Asn 
20 25 30 

Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 

85 90 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly lie 
100 105 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Gin Gly Asn lie 
115 120 125 

Gly Pro Gin Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 

Glu Val Gly Ala Pro Gly Met Gin Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 160 

Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Gin Gly Ala Pro 

165 170 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Gin Gly 
180 185 190 
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Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 200 205 

Pro Gly Asp Arg Gly lie Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Gin Gin Met Glu Ala Leu Lys Gly Lys Leu Gin Arg Leu 
225 230 235 " 240 

Glu Val Ala Phe Ser His Tyr Gin Lys Ala Ala Leu Phe Pro Asp Gly 

245 250 255 

His Arg Arg Leu Asp Lys Val Glu Glu Glu Val Asn Leu His Glu Asp 
260 265 270 

Phe Val Phe lie Lys Lys Leu Lys Arg Cys Asn Lys Gly Glu Gly Ser 
275 280 285 

Leu Ser Leu Leu Asn Cys Glu Glu Met Arg Arg Gin Phe Glu Asp Leu 
290 295 300 

Val Lys Asp lie Thr Leu Asn Lys Glu Glu Lys Lys Glu Asn Ser Phe 
305 310 315 J 320 

Glu Met Gin Arg Gly Asp Glu Asp Pro Gin He Ala Ala His Val Val 

325 330 335 

Ser Glu Ala Asn Ser Asn Ala Ala Ser Val Leu Gin Trp Ala Lys Lys 
340 345 350 

Gly Tyr Tyr Thr Met Lys Ser Asn Leu Val Met Leu Glu Asn Gly Lys 
355 360 365 

Gin Leu Thr Val Lys Arg Glu Gly Leu Tyr Tyr Val Tyr Thr Gin Val 
370 375 380 

Thr Phe Cys Ser Asn Arg Glu Pro Ser Ser Gin Arg Pro Phe He Val 
385 390 395 400 

Gly Leu Trp Leu Lys Pro Ser He Gly Ser Glu Arg He Leu Leu Lys 

405 410 415 



Ala Ala Asn Thr His Ser Ser Ser Gin Leu Cys Glu Gin Gin Ser Val 
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420 425 430 

His Leu Gly Gly Val Phe Glu Leu Gin Ala Gly Ala Ser Val Phe Val 
435 440 445 

Asn Val Thr Glu Ala Ser Gin Val He His Arg Val Gly Phe Ser Ser 
450 455 460 

Phe Gly Leu Leu Lys Leu 
465 470 
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